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1 Introduction 

Analogy is a computational process in which a given situation is understood by bringing to bear 
knowledge of previous, similar experiences. Analogy may be used to guide reasoning, to gener- 
ate conjectures about an unfamiliar domain, or to generalize several experiences into an abstract 
schema. Consequently, analogy is of great interest to both cognitive psychologists and artificial 
intelligence researchers. Psychologists wish to clarify the mechanisms underlying analogy in order 
to understand human learning and reasoning. Artificial Intelligence researchers wish to emulate 
analogical processing on computers to produce more flexible reasoning and learning systems. 

This paper describes the Structure-Mapping Engine (SME), a program built to explore the com- 
putational aspects of Gentner's Structure-Mapping theory of analogical processing [23,25]. SME has 
been used both as a cognitive simulation of human analogical processing and as a component in a 
larger machine learning system. 

SHE is both flexible and efficient. It constructs all * insistent ways to interpret a potential 
analogy and does so without backtracking. SME provides a "tool kit" for constructing matchers 
consistent with the kinds of comparisons sanctioned by Gentner's theory. A matcher is specified 
by a collection of rules, which indicate what things might match and estimate how strongly these 
matches should be believed. The program uses these estimates and a novel procedure for combining 
the local matches constructed by the rules to efficiently produce and evaluate all consistent global 
matches. This efficiency and flexibility makes the matching algorithm promising for exploration of 
both the space of cognitive models and the computational aspects of analogy for AI. 

Cognitive simulation studies can offer important insights for understanding the human mind. 
They serve to verify psychological theories and force cue to pin down those aspects of a theory 
which might otherwise be left unspecified. They also offer unique opportunities to construct ide- 
alized subjects, whose prior knowledge and set of available processes is completely known to the 
experimenter. Unfortunately, cognitive simulation programs tend to be complex and computation- 
ally expensive (c.f. [1,59]). Complexity can obscure the relationship between the theory and the 
program. Typically there are many design decisions in building a program, and if one ca >ot assign 
credit to them when analyzing results then it can be hard to see where the performance is really 
coming from. Often it is desirable to explore a space of similar architectures to determine what the 
consequences of particular design decisions are and to model particular performance in detail. Such 
explorations are very difficult if the major way to change the program's operation is surgery on the 
code. Being computationally expensive m »ans perfeming fewer experiments, and thus exploring 
fewer possibilities. While there have been several importa^ AI programs that study computational 
aspects of analogy (e.g., [3,65,66]), they were not designed to satisfy the above criteria. 

Recently there has been a plethora of approaches to analogy in AI (as we review later), but 
surprisingly little progress so far. Often papers describe programs that work on only a handful of 
carefully chosen examples, and do not specify the algorithms in a replicable fashion. We believe 
the reason so little progress has been made is that analogy is a complex problem, and that the 
appropriate decomposition is critical. Without a good decomposition, it is easy to tackle several 
semi-independent problems at once, or an underconstrained aspect of the problem, and become 
lost in the space of possible mechanisms. Our decomposition, described in the next section, is 
psychologically motivated. Roughly, SME focuses on the mapping aspect of analogy, leaving the 
access and application aspects to future studies. The power of the program that results, and its 
success on a wide variety of examples (over 40 as of this writing), provides additional evidence that 
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the decomposition is a good one. 

This paper examines the architecture of the Structure-Mapping Engine and how it has been 
used for machine learning and cognitive simulation. First, we review Gentner's Structure-Mapping 
theory and some of the psychological evidence for it. Next we discuss the organization of SME, 
including the knowledge representation conventions and the details of the algorithm. After a 
complexity analysis, we then illustrate SME's operation on several examples drawn from machine 
learning and cognitive simulation studies. Related work in both AI and psychology is reviewed 
next, followed by a discussion of further issues raised by this design. 

2 Structure-Mapping Theory 

The theoretical framework, for this research is Gentner's Structure-Mapping theory of rnalogy 
[23,24,25,26,27,28]. Structure-Mapping describes the set of implicit rules by which people interpret 
analogy and similarity. The central idea is that an analogy is a mapping of knowledge from one 
domain (the base) into another (the target) which conveys that a system of relations known to hold 
in the base also holds in the target. The target objects do not have to resemble their corresponding 
base objects. Objects are placed in correspondence by virtue of corresponding roles in the common 
relational structure. 

This structural view of analogy is based on the intuition that analogies are about relations, 
rather than simple features. No matter what kind of knowledge (causal models, plans, stories, 
etc.), it is the structural properties (i.e., the interrelationships between the facts) that determine 
the content of an analogy. For example, consider the heat Bow and water Sow situations shown in 
Figure 1. These situations are thought to be analogous because they share the complex relationship 
known as "flow" . In each, we have a rough picture of something Bowing downhill, from a source 
to a destination. We prefer to ignore the appearances and even specific defining properties of the 
objects, such as the fact that water and coffee are both liquids. Indeed, focusing on these attributes 
tends to confuse our picture of the analogy. 

2.1 Constraints on Analogy 

An important preliminary: We define the order of an item in a representation as follows: Objects 
and constants are order 0. The order of a predicate is one plus the maximum of the order of its argu- 
ments. Thus GREATER- THAN (x , y) is first-order if x and y are objects, and CAUSE [GREATER- THAN (x , y ) , 
BREAK (x)] is second-order. Examples of higher-order relations include CAUSE and IMPLIES. This 
definition of order should not be confused with the standard definition. 1 Essentially, we use this def- 
inition of order to indicate how deep the structure is below an item. Notice that intricate arguments 
with many layers of justifications will give rise to representation structures of high order. 

Given collections of objects {b t }, {t;} in the base and target representations, respectively, the 
tacit constraints on the analogical mapping M can be characterized as follows: 

1. Objects in the base are placed in correspondence with objects in the target: 
M: b; -> t t - 

l Under the standard definition, a logic is first-order if variables only range over objects and tecCiid-order when it 
permits variables to range over predicates as well. 
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WAJtMCOMff 

Figure 1: Two physical situations involving flow. 



2. Isolated object descriptions are discarded unless they are involved in a larger relational struc- 
ture. 

e.g. RED(b,) RED(t*) 

3. Relations between objects in the base tend to be mapped across: 

e.g. COLLIDED. by) — ♦ COLLIDE (t,*,ty) 

4. The particular relations mapped are determined by st/8tcmaticity y as defined by the existence 
of higher-order constraining relations which can themselves be mapped: 

e.g. CAUSECPUSHCbi.b^O^COLLIDECby,^)] => 
CAUSECPOSHCt^t^O.COLLIDECty.tk)] 

For example, consider the analogy between heat-flow and water-flow. Figure 2 shows what a 
learner might know about the domains pictured in Figure 1. In order to comprehend the analogy 
"Heat is like water* a learner must do the following (although not necessarily in this order): 

1. Set up the object correspondences between the two domains: 

heat water, pipe metal bar, beaker -+ collee, vial ice cube 

2. Discard object attributes, such as LIQUID (water). 

3. Map base relations such as 

GREATER-THAN [PRESSURE (beaker) , PRESSURE (vial)] 
to the corresponding relations in the target domain. 

4. Observe systematicity: i.e., keep relations belonging to a systematic relational structure in 
preference to isolated relationships. In this example, 
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WATER-FLOW 



HEAT FLOW 




FLO W(btaktr,vttl, water, pipt) 





FLOW(coffw.ice cube>.eat.baf) 



LlQtfD(*it«r) 



FLAT-TOP(wattr) 



FLAT-TOP(coffw) 



Figure 2: Simplified water flow and heat flow descriptions. 



CAUSE (GREATER-THAN [PRESSURE (beaker) , 
PRESSURE(vial)] , 
FLQW(beaker, vial, water, pip*)) 



is mapped into 



CAUSE (GREATER-THAN [TEMPERATURE (coffee) , 

TEMPERATURE (ice cube)] , 
FLOW(coffee, ice cube, heat, bar)) 

while isolated relations, such as 

GREATER - THAN [D I AMETER ( be aker ) , DIAMETER(vial) ] 

are discarded. 

The aystematicity principle is central to analogy. Analogy conveys a system of connected knowl- 
edge, not a mere assortrr ent of independent facts. Preferring systems of predicates that contain 
higher-order relations with inferential import is a structural expression of this tacit preference for 
coherence and deductive power in analogy. Thus, it is the amount of common, higher-order re- 
lational structure that determines which of several possible matches is preferred. For example, 
suppose in the previous example we were concerned with objects differing in specific heat, such as 
a metal ball-bearing and a marble of equal mass, rather than temperatures. Then DIAMETER would 
enter the mapping instead of (or in addition to) PRESSURE, since DIAMETER affects the capacity of 
a container, the analog to specific heat. 

2.2 Other types of similarity 

In addition to analogy, Structure-Mapping theory defines several other kinds of similarity. As we 
have seen, in analogy only higher-order relations are mapped. Aspects of object descriptions which 
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play no .ole in the relational structure are ignored. By contrast, in literal similarity both relational 
predicates and object-descriptions are mapped. Literal similarity typically occurs in within-domain 
comparisons, where the objects involved look alike as well as act alike. An example of a literal 
similarity is the comparison *Kool-Aid is like water." 2 In mere-appearance matches, it is primarily 
the object-descriptions which are mapped, as in the metaphor 

"Her eyes were like the deep blue of a summer sky." 

A fourth kind of mapping is the abstraction mapping. Here, the entities in the base domain 
are variables, rather th^a objects. Few, if any, attributes exist that do not contribute to the base's 
relational structure. The result of an abstraction matching is very close to the instantiation of a 
rule. The difference is that only entities may be variables, whereas in many pattern-directed rule 
system* predicates may be used in substitutions as well. 

It should be clear that Structure-Mapping neither subsumes unification, nor is subsumed by it. 
For example, the pair of statements 

(CAUSE (FLY PERS0H1) (FALL PERS0H1)) 
(CAUSE (FLT PERS0N2) (FALL PERS0H2)) 

could be part of an analogy, with PERS0H1 being mapped to PERS0N2, but these two statements do 
not unify since PERS0N1 and PERS0N2 are distinct constants. Conversely, 

(CAUSE (?X PERS0N1) (FALL PERS0H1)) 
(CAUSE (FLT ?Y) (FALL 11)) 

will unify, assuming ? indicates variables, with the substitutions: 

11 <- FLY 

?Y <- PERS0N1 

?Z 4- PERSON 1 

However, since Structure-Mapping treats variables as constants, these statments fail to be 
analogous in two ways. First, FLY and ?! are treated as distinct relations, and thus cannot match. 
Second, ?Y and ?Z are considered to be distinct entities, and thus are forbidden to map to the same 
target item (i.e., PERSONl). 

2.3 Subprocesses in analogy 

Structure-Mapping decomposes analogical processing into three basic stages ([29,22,26]): 

1. Access: Given a current target situation, one must first retrieve from long-term memory 
knowledge of another description, the base, which is analogous or similar to the target. 

2. Mapping: This stage establishes correspondences between the base and target. Potentially, 
there is additional knowledge in the base that can be transferred to the target. This additional 
knowledge is the 3et of candidate inferences sanctioned by the analogy. 

3 Notice that our characterixation of literal similarity is still structural, and thus is differs from other psychological 
approaches (e.g., (56]). 
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3. Evaluation and Use: There are two kinds of criteria for evaluating the quality of a match. The 
structural criteria include the number of similarities and differences, the degree of structural 
similarity involved, and the amount and type of new knowledge or insight the analog) provides 
via the candidate inferences. The second kind of criteria concerns the validity of the match and 
the inferences it sanctions. The inferences must be checked againct current world knowledge 
to ensure that the analogy at least makes sense, and may require additional inferential work 
to refine the results. Detailed characterization of the validity criteria lie outside the range of 
the Structure-Mapping theory. 

The Structure-Mapping Engine emulates the mapping stage of analogy and provides a struc- 
tural, domain-independent evaluation of the match. While we believe it can be used to model 
access, and provides useful results for accounts of evaluation and u&e (see [13,14]), we will ignore 
these issues for most of this paper. 

2.4 Empirical evidence 

Although the focus of this paper is on computational modeling, one set of psychological findings is 
particularly relevant. Empirical psychological studies have borne out the prediction that system- 
aticity is a key element of people's implicit rules for analogical mapping. Adults focus on shared 
systematic relational structure in interpreting analogy. They tend to include relations and omit 
attributes in their interpretations of analogy, and they judge analogies as more sound and more apt 
if base and target share systematic relational structure (23,29,30). In developmental work, it has 
been found that eight-year olds (but not five-year olds) are better at performing difficult mappings 
when the base structure is systematic [31]. 

3 The Structure-Mapping Engine 

A simulation of Gentner's theory has been implemented in the Structure-Mapping lgine (SME). 
Given descriptions of a base and target, SME constructs all structurally consistent mappings between 
them. The mappings consist of pairwise matches betw*.3n predicates and entities in the base 
and target, plus the set of analogical inferences sanctioned by the mapping. SME also p^vides 
a structural evaluation score for each mapping, allowing easy selection of the "best" moping, 
according to the rules of systematicity and consistency. 

Importantly, SME is not "haid wired" , but provides a testbed for implementing matchers consis- 
tent with Gentner's Structure-Mapping theory. In addition to analogy, SME can be used to simulate 
the other comparisons sanctioned by the Structure-Mapping theory. Such matchers may also serve 
a valuable role in reasoning or learning programs. For example, given the descriptions of water 
flow and heat flow shown in Figure 2, SME would offer several alternative interpretations for this 
potential analogy. In one interpretation, the central inference is that water flowing from the beaker 
to the vial corresponds to heat flowing from the coffee to the ice cube. Alternatively, one could 
map water to coffee, since they are both liquids. This is an interpretation provided by SME, but 
with a lower evaluation score. A learning system could use the structural evaluation to select the 
explanation most likely intended by the human teacher. 

This section describes the SME algorithm in sufficient detail to allow replication. We start by 
defining some simple conventions for knowledge representation, since these conventions are essential 
to understanding the algorithm. 
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3.1 Representation conventions 

We make as few representational assumptions as possible so that SME remains domain-independent. 
However, a few conventions are necessary. We use a typed (higher-order, in the standard sense) 
predicate calculus to represent facts. The constructs of this language are: 

Entities: Individuals and constants. 

Predicates: There are three types: functions, attributes, and relations. Each is described below. 
Dgroup: A description group is a collection of entitie. and facts about them, considered as a unit. 

We examine each construct in turn. 

3.1.1 Entities 

Entities are logical individuals, i.e., the objects and constants of a domain. Typical entities include 
physical objects, their temperature, and the substance they are made of. Primitive entities are 
declared with the del Entity form: 

(delEntity (name) 
[ type (EntityType)] 
[: const ant? {t I nil}] ) 

Since our language is typed, each entity type nan be declared as a subtype of an existing type 
using the :type option. For example, we might have 

(delEntity star :type Inanimate) 
(delEntity Sun :type star) 

to say that stars are inanimate objects, and our Sun is a particular star. Constants are declared 
by using the : constant? option, as in 

(delEntity zero :type number : constant? t) 

3.1.2 Predicates 

We first describe the three types of predicates, and then show how they are declared. 

Functions Functions map one or more entities into another entity or constant. For example, 
(PRESSURE piston) maps the physical object piston into the quantity which describe^ its 
pressure. We treat functions whose range are truth values as relations (see below), rather than 
functions. Consequently, Structure-Mapping treats functions differently from other types of 
predicates. It allows substitution of functions to acknowledge their role as an indirect way of 
referring to entities. 

Attributes A. attribute describes some property of an entity. Examples of attributes include RED 
and CIRCLE. It is well-known that a combination of a function and a constant is logically 
equivalent to an attribute. For example, 
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(RED BlockA) 

(« (COLOR BlockA) RED) 

and 

(SQUARE BlockA) 

(* (SHAPE BlockA) SQUARE) 

are logically equivalent. However, these two forms do not behave identically under Structure- 
Mapping. We assume that a reasoner has a particular piece of attribute information rep- 
resented in one form or another, but not both, at any particular time (We defer detailed 
discussion of this issue until Section 6.1). 

Relations Like attributes, relations range over truth values. However, the arguments to rela- 
tions can be other predicates as woll as entities. Examples of relations include CAUSE and 
GREATER- THAN. 

The del Predicate form declares predicates. It has several options, due in part to the existence 
of several types of predicates: 

(defPredicate (Name) (ArgxtmentDeclarations) {PredicateType) 
: expression- type (De f intdTypt) 
[: commutative? {t I nil}] 
[: lexpr? {t 1 nil}] ) 

(PredicateType) is either function, attribute, or relation, according to what kind of predicate 
(Name) is. The (Argument Declarations) allows the arguments to be named and typed. For 
example, the declaration: 

(def Predicate CAUSE ((antecedent event) (consequent event)) relation) 

states that CAUSE is a two-place relational predicate. Its arguments are called antecedent and 
consequent, both of type event. The names and types of arguments are for the convenience of the 
representation builder, and are not currently used by SME. However, the type of predicate is very 
important to the algorithms, as we will see below. 

The optional declarations : commutative? and :lexpr? provide SME with important syntactic 
information. : commutative? indicates that the predicate is commutative, and thus the order of 
arguments is unimportant when matching. : lexpr? indicates that the predicate can take any 
number of arguments 3 . Examples of commutative lexpr predicates include AND, OR, and SUM. 

3 The term derives from early lisp dialects which classified functions that took an arbitrary number of arguments 
as lexprs. 
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3.1.3 Facts and Dgroupa 

For simplicity, predicate instances are called facts. Notice we include terms corresponding to 
function applications as facts. The reason is that in discussing SME's operation we will often need 
to /efer to them by a short name. We call the predicate of the fact its functor. 

A Description Group, or Dgroup, is a collection of primitive entities and facts concerning them. 
Dgroups are defined with the del Description form: 

(defDescription {DescriptionNarne) 

entities ( {Entity i) , {Entity) {Entity { )) 

facts ({FactDeclarations))) 

where (FactDeclarations) take the form 

(fact) or 

({fact) rname (FactName)) 

For example, the description of water flow depicted in Figure 2 was given to SME as 

(defDescription simple-water-f low 

entities (water beaker vial pipe) 

facts (((flow beaker vial water pipe) :name wflow) 

((pressure beaker) :name pressure-beaker) 

((pressure vial) rname pressure- vial) 

((greater pressure-beaker pressure -vial) rname >pressure) 
((greater (diameter beaker) (diameter vial)) :name >diameter) 
((cause >pressure wflow) rname cause-flow) 
(flat-top water) 
(liquid water))) 

In addition, the description of heat flow depicted in Figure 2 was given to SME as 

(defDescription simple-heat-flow 

entities (coffee ice-cube bar heat) 

facts (((flow coffee ice-cube heat bar) rname hflow) 

((temperature coffee) rname temp-coffee) 

((temperature ice-cube) rname temp-ice-cube) 

((greater temp-coffee temp-ice-cube) rname >temperature) 

(flat-top coffee) 

(liquid coffee))) 

Notice that each fact does not need to be declared separately; SME will automatically create and 
name facts corresponding to (diameter beaker) and (diameter vial). 

We will refer to the facts and entities in a d*roup collectively as items. To describe the SME 
algorithm we need some terminology to express the structural relations between items. These 
expressions are rooted directed acyclic graphs, so we adopt standard graph-theory terminology. 
First, the offspring of a fact are its arguments. Entities have no offspring. An item I\ which is in 
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the transitive closure (arguments of arguments, etc.) of another item h b said to be a descendant 
of Jj, while ij is said to be an ancestor of Ji. An item with no ancestors is called a root The 
term Tree (I) refers to the subtree starting at J. Notice that in Structure-Mapping the order of a 
predicate is inversely proportional to its depth. 

3.2 The SME Algorithm: Overview 

Given descriptions of a base and a target, represented as Dgroups, SME builds all structurally 
consistent interpretations of the comparison between them. Each interpretation of the match is 
called a global mapping, or Gmap 4 . Gmaps consist of three parts: 

1. A Global Match: A set of pairwise matches between the facts and entities of the two Dgroups. 

2. Candidate Inferences: A set of new facts which the comparison suggests holds in the target 
Dgroup. 

3. A numerical evaluation score based on the Gmap's structural properties. 

Following the Structure-Mapping theory, only purely structural criteria are used to construct 
and evaluate the mappings. SME has no other knowledge of either base or target domain. Neither 
rules of inference nor even logical connectives themselves are "wired in 35 to the algorithm. Each 
candidate inference must be interpreted as a surmise, rather than a valid conclusion. The evaluation 
score reflects the aesthetics of the particular type of comparison, not validity or potential usefulness. 
Testing the validity of candidate inferences and determining the utility of a match are left to other 
modules, as described in Section 2. This decomposition leads to strikingly good computational 
performance — better than algorithms which use additional criteria, as we argue below in Section 
5. 

Match rules specify what pairwise matches are possible and provide local measures of evidence 
used ii> computing the evaluation score. These rules cje the key to SME's flexibility. To build a 
new matcher one simply loads a new set of match rules. This has several important advantages. 
First, we can simulate all of the types of comparisons sanctioned by Structure-Mapping theory 
with one program. Second, we could in theory "tune" the rules if needed to simulate particular 
kinds of human performance (although, importantly, this flexibility has not been needed so far!). 
Third, we can also simulate certain other analogy systems (including [35,65], as described below) for 
comparison purposes. In our experiments using SME, we currently use three rule sets, depending on 
the phenomenon being investigated. One set of rules focuses on object descriptions and is called the 
mere-appearance rules. In addition, the analogy rule set prefers relations, while the literal similarity 
rules look at both relations and object descriptions. Because the literal similarity rules are the 
most inclusive, we will be using them in this section to explain the system. Subsequent examples 
will then focus on analogy matches. 

The SME algorithm is logically divided into four stages. 

1. Local match construction: Finds all pairs of ({Baseltem), (Targetltem)) that potentially can 
match. A Match Hypothesis is created for each such pair to represent the possibility that this 
local match is part of a global match. 

*The definition of Gmap is Inspired in part by de Kleer's work on Assumption-based Truth Maintenance, although 
we do not use an ATMS in the actual code. The idea of combining local solutions by constructing maximally consistent 
sets is analogous to the process of interpretation construction in an ATMS. As explained below, we also find bit-vectors 
a useful implementation technique for carrying out set operations needed to maintain structural consistency. 
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2. Gmap construction: Combines the local matches into maximal consistent collections of cor- 
respondences. 

3. Candidate inference construction: Derives the inferences suggested by each Gmap. 

4. Match Evaluation: Attaches evidence to each local match and uses this evidence to compute 
structural evaluation scores for each Gmap. 

We now describe each computation in detail, using a simple example to illustrate their operation. 
3.2.1 Step 1: Local match construction 

Given two dgroups, SME begins by finding potential matches between items in the base and target 
(see Figure 3). What can match is specified by match constructor rules, which take the form: 

(MHCrale ({Trigger) {IBaseVariable) {?TargctVariablc) 
[:test {TestForm)]) 

{Body)) 

There are two types of constructor rules, each indicated by a different value for {Trigger). The 
first type of rule is indicated by a : filter trigger. These rules are applied to each pair of base and 
target facts, executing the code in {Body). If the rtest option is used, {TestForm) must return 
true for the body to be run. For example, to state that a fact in the base may match a fact in the 
target whose functor is identical, we write: 

(MHCrule (:f liter ?b ?t :test (equal (fact-functor ?b) 

(fact-functor ?t))) 

(install-HH ?b ?t)) 

The second type of rule is indicated by a trigger of : intern. These rules are run on each 
match hypothesis as it is created. Typically they create match hypotheses between any functions 
or entities which are the arguments of the facts joined by the match hypothesis that triggered the 
rule. 

Currently we use three sets of rules, one for analogy, one for literal similarity, and one for mere 
appearance matches. The current literal similarity rule set uses only three match constructor rules. 
One rule is the filter rule shown above. The other two are intern rules, listed in Appendix A. The 
content of the first intern nxle is, roughly, 

"If the match hypothesis concerns two facts, then create a match hypothesis between 
any corresponding arguments that are both functions or entities." 

The second intern rule is a specialization of the first that runs only on commutative predicates 
(i.e., the a corresponding arguments" condition is removed). The filter rule suffices to b^ild match 
hypotheses between facts; the intern rules introduce the match hypotheses between entities and 
functions sanctioned by these fact matches. 

The rules for the other types of comparisons are similar. Tae analogy rules only create matches 
between attributes when they are part of some higher-order structure. The mere appearance rule 
set completely ignores higher-order structure. All three rule sets are listed in Appendix A. 
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- MH between predicates 



A - MH between entities (Enup) 

Figure 3: Local Match Construction. The water flow and heat flow descriptions of Figure 2 have 
been drawn in the abstract and placed to the left and right, respectively. The objects in the middle 
depict match hypotheses. 



We will need some conventions for talking about match hypotheses. We denote the hypothesis 
that b{ and tj match by MHfatj). When no ambiguity will result, we will simply say MH. The 
match hypotheses for a particular run of SME form a directed acyclic graph, with possibly many 
roots. We will use the same terminology to refer to the structural properties of graphs of match 
hypotheses (offspring, descendants, ancestors, root) as we use for describing items in Dgroups. 



Example: Simple analogy between heat and water In this example we will use the literal 
similarity rule set, rather than true analogy, in order to better illustrate a variety of details of the 
algorithm. The result of running these rules on the water flow and heat flow dgroups of Figure 2 is 
shown in Figure 3 (see also Figure 4). Each match hypothesis locally pairs an item from the base 
dgroup with an item from the target dgroup. 

There are several important things to notice in Figure 4. First, there can be more than one 
match hypothesis involving any particular base or target item. Here, TEMPERATURE can match with 
both DIAMETER and PRESSURE, since there are corresponding matches between the GREATER- THAN 
facts in both dgroups. Second, note that, with the exception of functions, predicates must match 
identically. Entities, on the other hand, are matched on the basis of their roles in the predicate 
structure. Thus while TEMPERATURE can match either PRESSURE or DIAMETER, IMPLIES cannot 
match anything but IMPLIES. This distinction reflects the fact that functions are often used to 
refer to objects and other types of entities, which are fair game for substitution under analogy. 
Third, not every possible correspondence is created. We do not, for example, attempt to match 
TEMPERATURE with water or heat with beaker. Local match6s between entities are only created 
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WATER FLOW HEAT FLOW 

Cause 




Liquid - 
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Figure 4: Water Flow / Heat Flow Analogy After Local Match Construction. The lines connecting 
water Sow and heat flow items depict match hypotheses. Match hypotheses between entities are 
called Emaps (e.g., the link between beaker and coffee). Match hypotheses which are not descended 
from others are called roots (e.g., the links between the GREATER predicates and the link for the 
predicate FLOW). Each mapping receives a local evaluation score during subsequent processing, as 
described below 
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when justified by some other identity. This significantly constrains the number of possible matches 
in the typical case. 

3.2.2 Step 2: Global Match Construction 

The second step in the SME algorithm combines local match hypotheses into collections of global 
matches (Gmaps). Intuitively, each global match is the largest possible set of match hypothsses 
that depend on the same one to one object correspondences. 

More formally, Gmaps consist of maximal, structurally consistent collections of match hypothe- 
ses. A collection of match hypotheses is structurally consistent if it satisfies two criteria: 

1. Preservation: No two match hypotheses assign the same base item to multiple target items 
or any target item to multiple base items. 

2. Grounding: If a match hypothesis MH is in the collection, then so are the match hypotheses 
which pair up all of the arguments of MH's base and target items. 

The preservation criteria enforces strict one to one mappings. The grounding criteria preserves 
connected predicate structure. A collection is maximal if adding any additional match hypothesis 
would render the collection structurally inconsistent. 

Requiring structural consistency both reduces the number of possible global collections and helps 
preserve the soundness and plausibility of the candidate inferences. Without it, every collection 
of local matches would need to be considered, wasting much effort on degenerate many-to-one 
mappings without any possible inferential value. The maximality condition also reduces the number 
of Gmaps, since otherwise every subset of a Gmap could itself be a Gmap. 

The formation of global matches is composed of two primary stages: 

1. Compute consistency relationships: Here we generate for each match hypothesis the sets of 
entity mappings it entails, what match hypotheses it locally conflicts with, and which match 
hypotheses it is structurally inconsistent with. This information simplifies the detection of 
contradictory sets of match hypotheses, a critical operation in the rest of the algorithm. 

2. Merge match hypotheses: Compute Gmaps by successively combining match hypotheses as 
follows: 

(a) Form initial combinations: Combine interconnected and consistent match hypotheses 
into an initial set of Gmaps. 

(b) Combine dependent Gmaps: Since base and target dgroups are rarely isomorphic, some 
Gmaps in the initial set will overlap in ways that allow them to be merged. The advan- 
tage in merging them is that the new combination may provide structural support for 
candidate inferences. 

(c) Combine independent collections: The results of the previous step are next combined to 
form maximal consistent collections. 

Importantly, the process of Gmap construction is completely independent of Gmap evaluation. 
Which Gmaps are constructed depends solely on structural consistency. Numerical evidence is us*d 
only as a source of information to compare the relative merits of Gmaps. 

We now describe the algorithm in detail. 
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Computing consistency relationships Consistency checking is the crux of Gmap construc- 
tion. Therefore we begin by defining several *ets which represent, for each match hypothesis, which 
entity mappings it entails and which other hypotheses it is inconsistent tfith. Consider a particular 
match hypothesis M H(b i} tj) involving base item ^ and target item tj. If b i} tj are facts, then by 
the grounding criteria the match hypotheses linking their arguments must alsc be in any collection 
that MH(bi, tj) is in- Applying this criteria recursively, we see that the match hypotheses indi- 
rectly mentioned by MH(bijtj) form a tree-like graph which "bottoms out" with match hypotheses 
involving primitive entities since they have no arguments (see Figure 5). This allows us to know 
what entity pairings a particular match hypothesis depends on: 

Definition 1. An Er\ap is a match hypothesis between primitive entities. By recursive ap- 
plication of the grounding criteria, it can be seen that each match hypothesis implies a specific 
set of entity correspondences. Call the set of Emaps implied by a match hypothesis MHfatj) 
Emapa(MH{b i} tj)). EmapsfMHfc, tj)) is simply the union of the Emaps supported by MH (6< , tj) 's 
descendants. 

The preservation criteria enforces a strict one-to-one mapping. We must also associate with 
each M H(b iy tj) the set of match hypotheses which conflict with it. The most obvious conflicts are 
those match hypotheses which provide alternate mappings for fc? and tj. 

Definition 2, Given a match hypothesis Af£T(& f ,ty), the set Conflicting (MH^tj)) consists of 
the set of match hypotheses which represent the alternate mappings for 6< and tji 

Conflicting{MH(b i} tj)) = (J {MH(b k , tj) \ b h ± &,} |J 

6*€6a*e 

U {MH(b h **) | ** # *,-> 

The set Conflicting(MH(bi } tj)) only notes local inconsistencies. However, we can use it and 
Emap8(MH{bi,tj)) to define a NoGood set which contains all match hypotheses that can never be 
in the same Gmap as MH(bi,tj). 

Definition 3* The NoGood set for a match hypothesis, MH i} defines the set of match hypotheses 
which can never appear in the same Gmap as MHi . This set is the union of MHi* Conflicting set 
with the NoGood sets for all of its descendents. If MHi is an Emap, then the NoGood set collapses 
into equalling its Conflicting set. 

NoGood(MHi) = Conflicting(MHi) U ' \J NoGood(MHj) 

MH]€Arg»(MH t ) 

The algorithm computes Conflicting, Emaps, and NoGood sets as follows. First, Conflicting 
is computed for each match hypothesis, since it requires only local information. Second, Emaps 
and NoGood are computed for each Emap. Third, Emaps and NoGood sets are computed for all 
other match hypotheses by propagating the results from Emaps upwards to those match hypotheses 
which mention them. 

Two observations should be made about this computation. First, these operations can be 
efficiently implemented via bit vectors. For example, SME assigns * unique bit position to each 
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0 - MH b«tweea predicates 
A - MH between entities (Enup) 

Figure 5: Water Flow - Heat Flow analogy after computation of Conflicting relationships. Simple 
lines show the tree-like graph that the grounding criteria impojes upon matcb hypotheses. Lines 
with circular endpointa indicate the Conflicting relationships between matches. Some of the original 
lines from MH construction have been left in to show the source of a few Conflicting relations. 



match hypothesis, and carries out union and intersection operations by using QR and AND. Second, 
it is important to look for justification holes in the match hypothesis trees — match hypotheses 
whose arguments fail to match. Such match hypotheses will always fail the grounding criteria, and 
hence should be removed. For example, if one of the PRESSURE - TEMPERATURE match hypotheses 
had not been formed (see Figure 4), then the match between their governing GREATER predicates 
would be removed. Notice that removing justification holes eliminates many blatantly incorrect 
matches, such as trying to place an eighth-order IMPLIES in correspondence with a second-order 
IMPLIES. 

The next step in Gmap construction is to identify these match hypotheses which are internally 
inconsistent, and thus cannot be part of any Gmap. 

Definition 4. A match hypothesis is Inconsistent if the Emaps of one subtree of its descendants 
conflicts with the Emaps entailed by another subtree of its descendants: 

Inconsistent(MHi) iff Emaps(MHi) n NoGood(MHi) £ 0 

With the above information, SME can easily identify what combinations of match hypotheses are 
inconsistent. Furthermore, some match hypotheses cam be identified as internally inconsistent, since 
it is possible for the descendants of a match hypothesis to have mutually incompatible bindings. 
Global match construction then proceeds by collecting sets of consistent match hypotheses. The 
maximality requirement on Gmaps suggests organizing the computation top-down, rather than 
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bottom-up. Call a match hypothesis which is not the descendant of any other match hypothesis a 
root If a root is consistent, then the entire structure under it is consistent and may form an initial 
Gmap. However, base and target d groups are rarely isomorphic, so several merge steps are also 
required. The next sections describe the procedure in detail. 

Merge Step 1: Form initial combinations. The first step is to combine interconnected and 
consistent structures (Figure 6a). In the simplest case, the entire collection of descendants of 
a root may be collected together to form a globally consistent match. However, if the root is 
not consistent, then the same procedure is applied recursively to each descendant (i.e., each 
immediate descendant is now considered as a root). This effectively removes inconsistent 
match hypotheses from further consideration. The resulting set will be called Gmaps\. The 
procedure is: 

1. Let Gmapsi = 0. 

2. For every root Af-ff(6<,ty) 

(a) If ~*Incon8istent(MH(bijtj)) } then create a Gmap GM such that 
Element$(GM) = Trcc(M5(6 t -,ty)). 

(b) If InconBistcnt^Hfiijij)), then recurse on Offspring(MH(bi } tj)). 

3. For every GM 6 Gmap*i, 

(a) NoGood(GM) = U M H(6^ty)€^ot,(^^^( M (^^i)) 

(b) Emap${GM) = U w( a,^j 6 a^«f m ^( i ' ff 

At this stage inconsistent match hypotheses have been completely .eliminated. If the base and 
target Dgroups had only a single root, the next two stages would have no effect. However, things 
are rarely that simple. Typically, elements of Gmaps\ that are consistent with one another must 
be merged to provide maximality. Consistency between two GMaps is defined as: 

r* • * \jt , . »v \ Elcmtnts{GMapi) n NoGood{GMapj) = 0 

Conststent(GMa Pi UGMa Pi ) tff A N ^( GMQIH) l n E lernents(GMa!>-) = $ 

Merge Step 2: Combine dependent but unconnected Gmaps. Because the target may lack 
some of the higher-order relations that exist in the base, two or more Gmaps from Gmap8\ 
may be part of the same base structure. This step forms Gmapsi by merging all members of 
Gmaps\ that share common base structure and that are consistent (Figure 6b). This puts all 
match hypotheses with shared base structure together. When candidate inferences are added 
to the Gmap, they will fill in the missing base structure. 

Merge Step 3: Combine independent collections. Any two elements olGmapst which have 
overlapping base structure cannot consistently be merged, since if they could he then the 
previous step would have merged them. However, elements of Gmapst which do not con- 
tain overlapping structure can be consistently merged, since they are independent from the 
perspective of structural consistency. This final step generates all consistent combinations of 
Gmaps from Gmapst by successive unions, keeping only those combinations that are maximal 
(Figure 6c). 
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W (b) (c) 

l] - MH between predicates 

A - MH between entities (Emap) 

Figure 6: GMap Construction, (a) Merge step 1: Interconnected and consistent, (b) Merge step 
2: Consistent members of the same base structure, (c) Merge step 3: Any fu-ther consistent 
combinations. 



Example: Simple analogy between heat and water Figure 6 shows how the Gmaps are 
formed for the simple water flow / heat flow example. Recall that so far all we have is a large 
collection of match hypotheses, each representing a local pairing of an item from the base and an 
item from the target which could be part of a larger match. After merge step 1, only isolated 
collections exist, Merge step 2 combines the PRESSURE to TEMPERATURE mapping with the FLOW 
mapping. Finally, merge step 3 combines the isolated water and coffee attributes (see also Fig- 
ure 7). Notice that the FLOW mapping is structurally consistent with the DIAMETER to TEMPERATURE 
mapping. However, because merge step 2 placed the FLOW mapping into the same Gmap as the 
PRESSURE to TEMPERATURE mapping, merge step 3 was unable to combine the FLOW mapping with 
the DIAMETER to TEMPERATURE Gmap. 

3*2.3 Step 3: Compute Candidate Inferences 

Each Gmap represents a set of correspondences that can serve as an interpretation of the match. 
For new knowledge to be generated about the target, there must be information from the base which 
can be "carried over* into the target. Not just any information can be carried over — it must be 
consistent with the substitutions imposed by the Gmap, and it must be structurally grounded in 
the Gmap. By structural grounding, we mean that its subexpressions must at some point intersect 
the base information belonging to the Gmap. Such structures form the candidate inferences of a 
Gmap. 

Recall that Structure-Mapping does not guarantee that any candidate inference is valid. Each 
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candidate inference is only a surmise, which must be tested by other means. SME does guarantee that 
candidate inferences are structurally consistent and grounded. Aside from some simple structural 
consistency tests, mentioned below, SME uses no other rules of inference, or any other source of 
knowledge outside the language description and the base and target dgroups to evaluate a match. 
By theoretical assumption, such validity checking is the province of other modules which use SME's 
output. 5 

Tb compute ttie candidate inferences for a Gmap GAf, SME begins by examining each root 
Bz in the base Dgroup to see if it is an ancestor of any base facts in the Gmap. If it is, then any 
elements in Descendants(Bz) which are not in BaseItems(G Af) are included in the set of candidate 
inferences. A weak consistency check is performed by verifying that new, noncommutative facts are 
not commuted versions of existing facts. For example, if (GREATER (MASS sun) (MASS planet)) 
existed in the target, (GREATER (MASS planet) (MASS sun)) would be an inconsistent candidate 
inference. 

The candidate inferences often include entities. Whenever possible, SME replaces all occurrences 
of base entities with their corresponding target entities. Sometimes, however, there will be base 
entities that have no corresponding target entity; i.e., the base entity is not part of any match 
hypothesis for that Gmap. What SME does depends on the typ* of entity. If the base entity is a 
constant, such as zero, it is brought directly into the target unchanged. Otherwise, SME introduces 
a new, hypothetical entity into the target which is represented as a skolem function of the original 
base entity. Such entities are represented as (*skolea* base-entity). 

Example: Simple analogy between heat and water We return now to our extended exam- 
ple. In Figure 7, Gmap #1 has the top level CAUSE predicate as its sole candidate inference. In 
other words, this Gmap suggests that the cause of the flow in the heat dgroup is the difference in 
temperatures. 

Suppose the FLOW predicate was missing in the target Dgroup. Then the candidate inferences 
for a Gmap corresponding to the pressure inequality would be both CAUSE and FLOW, as well as 
conjectured target entities corresponding to water (heat) and pipe (bar). The two skolemized 
entities would be required because the match for the predicate FLOW is what provides a match from 
water and pipe to heat and bar. Note that GREATER- THAN [DIAMETER(cottee) , DIAMETER(ice 
cube)] is not a valid candidate inference for the first Gmap because it does not intersect the 
existing Gmap structure. 

3.2.4 Step 4: Compute Structural Evaluation Scores 

Typically a particular pair of base and target will give rise to several Gmaps, each representing a 
different interpretation of the match. Often it is desired to select only a single Gmap, for example 
to represent the best interpretation of an analogy. Many of these evaluation criteria (including 
validity, usefulness, and so forth) lie outside the province of Structure-Mapping, and rely heavily 
on the domain and application. However, one important component of evaluation is structural — 
for example, one Gmap may be considered a better analogy than another if it embodies a more 
systematic match. SME provides a programmable mechanism for computing a structural evaluation 
score (SES) for each Gmap. This score can be used to rank-order the Gmaps in selecting the "best" 
analogy, or as a factor in a more complex (but external) evaluation procedure. 

& One such module is described in (13,14). 
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Rule Filt: literal-similarity, rules Humber of Match Hypotheses: 14 

Goap #1: { (>PRESSUR£ > TEMPERATURE) (PRESSURE-BEAKER TEMP-COFFEE) 
(PRESSURE-VIAL TEMP- ICE-CUBE) (VFLOW HFLOV) > 
Emaps: { Cbeaker coffee) (vial ice-cube) (vater heat) (pipe bar) } 
Weight: 5.09 

Candidate Inferences: (CAUSE >TEMPERATUPE HFLOW) 

Gmap #2: { (>DIAMETER > TEMPERATURE) (DIAMETER -i TEMP-COFFEE) 
(DIAME7ER-2 TEMP-ICE-CUBE) > 
Eaaps: { (beaker coffee) (vial ice-cube) } 
Weight: 3.94 
Candidate Inferences: { > 

Gmap #3: { (LIQUID-3 LIQUID-6) (PLAT-TOP-4 FLAT-TOP-6; > 
Enape: { (water coffee) > 
Weight: 2.44 
Candidate Inferences: { } 

Figure 7: Complete SME interpretation of Water Flow - Heat Flow Analogy. 



The structural evaluation score is computed in two phases. First, each match hypothesis is 
assigned some lozal degree of evidence, independently of what Gmaps it belongs to. Second, 
the score for each Gmap is computed based on the evidence for its match hypotheses and the 
structural properties of the Gmap itself (such as the number and kind of candidate inferences). We 
will first introduce the general e/idence processing architecture, followed by a description of how 
the structural evaluation scores are computed. 

The scoring of matches is programmable: evidence for or against each aspect of the match 
is found by running match evidence rules and combining their results. These rules are similar 
syntactically to the match constructor ruies and provide evidence scores for a match in the form of 
a probabilistic weight ranging between 0 and 1. VT sing rules to provide evidence greatly increases 
SME's programmability. T mportantly, * ese evidence scores do not rely on any probabilistic or 
evidential information about the base r target per se. As we describe below, we currently are 
using the Dempster-Shafer formalism. However, our algorithms are independent of the details of 
Dempster-Shafer, and should work with any formalism for combining evidence. 

The management <tf evidence rules is performed by a Belief Maintenance System (BMS) [12]. 
A BMS is a form of Truth-Maintenance system, extended to handle numerical weights for evidence 
and degree of belief. In SME, a BMS node b associated with every match hypothesis and Gmap. 
BMS justifications are Horn clause*, annotated with evidential weight . If the system knows that 
belief in itemj is dependent in some way upon belief .in item 2 , then it automatically modifies belief 
in itemi whenever new information causes a change in belief of itemj. In the limiting case of 
evidential weights of only 1 and 0 (i.e., true and false), the BMS behavior reduces to that of a 
standard justification-based TIvIS. 

The current BMS implementation uaea the Dempster-Shafer formalism fo» belief and combines 
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evidence with a simplified form of Dempster's rule of combination [54,48,33,12]. Using a modified 
form of Shirr's representation, we express the belief in some proposition by the pair (s(i.) , s(-> 
A)), whe. a (A) represents the current amount of support for A and s(-> A) is the current support 
against A. Dempster's rule provides a means for combining evidence Nased upon different sources 
of informatics. Thus, given that Belief (A)»(0. 4, 0) and Belief (B)«(0. 6, 0), together with 
(IMPLIES A C)( 0 j t0 ) and (IMPLIES B C ) ( 1(0 ), Dempster's rule provides a belief in C equal to (0.728, 
0.0). 

Pattern-directed rules are provided that trigger on certain events in the knowledge base (e.g., 
[45,6]). The rules are of the form: 

(rule ((NcatcdTriggcra)) {Body)) 
where a nested-trigger is of the form 

((Trigger) {Pattern) [:test (TestForm)! ) 

For example, the rule 

(rule ((: intern (bird ?x))) 

(asaertl (implies (bird ?x) (flies ?x) (0.90 . 0.05)))) 

causes the implication "if Fido is a bird, then there is a 90 to 95% probability that Fido can fly* to 
be asserted when (bird Fido) first appears in the knowledge base (whether it is believed or not). 
Notice that the rule system automatically converts the implies statement into a Horn clause; such 
translations will automatically be performed on all compositions of implies, and, or, and not. 
We now describe the two phases for computing structural evaluation scores. 

Assigning local evidence The local evidence for each match hypothesis is found by running 
the match evidence rules and combining their results. The rules trigger on various properties 
of a match hypothesis and provide evidence for or against it based upon those properties. In 
addition to providing direct evidence for a match hypothesis, the rules can also assert relationships 
between evidence for different hypotheses. For example, the systematicity preference for analogy is 
implemented by a rule that propagates belief in a match hypothesis to its offspring, thus increasing 
the evidence for Emaps that provide structural ground for a large systematic structure. All evidence 
is combined by asserting such relationships. 

Let us consider some analogy evidence rules for concreteness. A match hypothesis involving 
two facts looks plausible if the predicates are the same, if the facts are of similar order in the base 
and target dgroups, and if their arguments can potentially match. A match hypothesis involving 
two facts appears implausible if the predicates are relations and their names are different or if the 
difference in their order is greater than one. For modularity, we separate each criteria into distinct 
rules. For example, 

( a 3sert ! 1 same - f tine tor) 

(rule ((.intern (MH ?b ?t) :test (and (fact? ?b) (fact? ?t) 

(eq (fact-functor ?b) 

(fact-functor ?t))))) 
(assert! (implies same-functor (MH ?b ?t) (0.5 . 0.0)))) 
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states that "if the base item and target item are facts with the same functors, then supply 0.5 
evidence in favor of the match hypothesis." A strong preference for systematicity is expressed by 
the local evidence rulo: 

(rule ((rintern (MH ?bl ?tl)) 

(rintern (MH ?b2 ?t2) :test (ctoldren-of ? ?b2 ?t2 ?bl ?ti))) 
(assert! (implies (MH ?bl ?tl) (MH ?b2 ?t2) (0.8 . 0.0)))) 

which propagates 80% of a match hypothesis' belief to its offspring. As a result, the more matched 
structure that exists above a given match hypothesis, the more that hypothesis will be believed. 
Thus this "trickle down" effect provides a local encoding of the systematicity principle. 

Computing the global Structural Evaluation Score Clearly an important factor in a Gmap's 
score is the evidence for the match hypotheses which participate in it. However, there are a num- 
ber of other factors that are potentially relevant as well. Such factors include the number and 
size of connected components, the existence and structure of the candidate inferences, and other 
graph-theoretic properties of the Gmap. 

We suspect that different factors will be relevant for different applications, and for modeling the 
different "analogical styles" of human subjects. Consequently, evidence rules are used to compute 
the evidence for Gmaps as well, to provide maximum flexibility. In this paper, we use only the 
following evidence rule for Gmaps: 

(rule ((rintern (GMAP ?gmap))) 

(delist (?mh (gmap-elements ?gmap)) 

(assert! (implies ?jnh (GMAP ?gmap))))) 

which states that the belief of each match hypothesis is added to the belief of the Gmaps it is a 
member of. We have found this simple rule to be sufficient for all of the examples encountered 
so far, and so have not yet addressed the issue of evidence due to candidate inferences or graph 
theoretic structure. 

Although the evidence for match hypotheses is constrained to be between 0 and 1, we do not 
normalize the evidence for Gmaps in the same way. Instead, we simply take as evidence for the 
Gmap the sum of the evidence for its match hypotheses 6 . Originally we combined evidence for 
Gmaps according to Dempster's rule, so that the sum of beliefs for all the Gmaps equal 1 [15]. 
We discovered two problems with this scheme. First, Dempster's rule is particularly susceptible to 
roundoff problems (i.e., unstable). Second, normalizing Gmap evidence prevents us from comparing 
matches using different base domains (as one would want to do for access experiments), since the 
score would be a function of the other Gmaps for a particular base and target pair. Under the 
current scheme, the evidence score can be used to compare matches of different base descriptions 
with the same target domain. However, it still cannot be used to compare two completely different 
analogies (i.e., different base, different target). 7 

c The BMS allows one to declare certain forms, such as (GMAP ?gnap), for special treatment. Thus, while the 
syntax of the Gmap evidence rule looks the same as the MH evidence rules (i.e., the use of IMPLIES), evidence is 
combined by addition rather than Dempster's rule. 

7 One of our current research goals is the construction of a structural evaluator that would produce scores cor- 
responding to a single, fixed scale. With this evaluator, SME would then be able to rate two completely different 
similarity matches as being equally good, regardless of how different their domain descriptions were in size. 
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The BMS justifications provide a valuable tool for exploring the consequences of evidence rules. 
By using interrogatives similar to those in truth-maintenance systems, one can get a clear picture 
of how the evidence rules are combining to produce the observed results. For example, the belief of 
(0.712, 0.0) in the match hypothesis between PRECSURE and TEMPERATURE (Figure 4) is explained 
by the BMS as follows: 

(MH PRESSURE-BEAKER TEMP-COFFEE) has evidence (0.712, 0.0) due to 
IMPLICATION (MH >PRESSURE >TEMPERATURE) ) (0 . 52 , 0 . 0) 
IMPLICATION (CHILDREN-POTENTIAL) (0.4, 0.0) 

V/e suspect that the ability to "tune" the criteria for choosing a Gmap will be important for 
modeling individual differences in analogical style and a subject's domain knowledge. For example, 
a conservative strategy might favor taking Gmaps with some candidate inferences but not too many, 
in order to maximize the probability of being correct. 

Although the match and evidence rules are programmable, it is important to note that all of 
the examples given in this paper use the same set of analogy rules, unless otherwise specified. In 
addition, the only other rules we have ever used are the literal similarity and mere-appearance rule 
sets. No "example-dependent" modifications have been performed to produce better results on any 
particular example. 

Example: Simple analogy between heat and water Returning to Figure 7, note that the 
"strongest 5 ' interpretation (i.e., the one which has the highest structural evaluation score) is the 
one we would intuitively expect. In other words, beaker maps to coliee, vial maps to ice-cube, 
water maps to heat, pipe maps to bar, and PRESSURE maps to TEMPERATURE. Furthermore, we 
have the candidate inference that the temperature difference is what causes the flow. 

3*3 Analysis 

Here we review the SME algorithm and analyze its complexity. The algorithm is not straightforward, 
and depends critically on the particular data and match rules it is given. Consequently, we focus on 
identifying best-case and worst-case bounds as well as the factors which critically affect performance. 

The algorithm is summarized in Figure 8. Referring back to our extended water flow / heat flow 
example, SME first hypothesized local matches between individual facts and entities. The resulting 
matches, shown in Figure 4, were then combined to form Gmaps. The three-step merging process, 
shown in Figure 6, produced the set of global mappings shown in Figure 7. The set of candidate 
inferences and a structural evaluation score was also created for each Gmap. 

We use the following notation in the complexity analysis: 

Sb = Number of entities in the base dgroup. 
St = Number of entities in the target dgroup. 
7b — Number of facts in the base dgroup. 
Tt s Number of facts in the target dgroup. 
At = Number of match hypotheses. 
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• Run MHC rales to construct match hypotheses. 

• Calculate the Conflicting set for each match hypothesis. 

• Calculate the EMaps and NoGood sets for each match hypothesis by upward 
propagation from entity mappings. 

• During the propagation, delete any match hypotheses that have justification holes. 

• Merge match hypotheses into Gmaps. 

1. Interconnected and consistent. 

2. Consistent members of same base structure. 

3. Any further consistent combinations. 

• Calculate the candidate inferences for each GMap. 

• Score the matches 

1. Local match scores. 

2. Global structural evaluation scores. 



3.3.1 Analysis of local match construction 

The number of match rules is small, and hence largely irrelevant. The : filter rules are run for 
each pair of base and target predicates. Consequently, they will always require 0(Nt>* Nt). The 
: intern rules are run once on each match hypothesis. In the worst case, M = Nt> * Nt, or roughly 
iV 2 . But in practice, the actual number of match hypotheses is substantially less, usually on the 
order of cN y where c is less than 5 and N is the average of and iV t . Thus, in practice, : intern 
rules have a run time of approximately 0(N). 

3.3.2 Analysis of Conflicting calculation 

Recall that SME assigns a Conflicting set to each match hypothesis, MH(b u t 3 ) which represents 
the alternate mappings for b % and tj. The conflicting sets are calculated by examining the match 
hypotheses each base item appears in and the match hypotheses each target item appears in. Let C 
be the average number of alternative matches each item in the base and target appears in. SME loops 
through the C match hypotheses twice: once to form the bitwise union of these match hypotheses 



Figure 8: Summary of SME algorithm. 
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and once to update each hypotheses' Conflicting set. *~hus, the entire number of operations is 

ft * 2C) + (£ h * 2C) + (? t * 2C) + {£ t * 2C) 

The worst case is when a match hypothesis is created between every base and target item. If 
we also assume JV* = JV*, then C = in that case. The number of operations becomes 4N t 2 
or approximately 0(N 2 ). Conversely, the best case performance occurs when C is 1, producing 
0(maz(Nh,Nt)) operations. In our experiments so far, we find that C is typically quite small, and 
so far has always been less than 10. Consequently, the typical performance lies between 0(N) and 
0(N 2 ). 

3.3.3 Analysis of EMaps and NoGood calculation 

Recall that once the Conflicting sets are calculated, the Emaps and NoGood sets are propagated 
upwards from the entity mappings through the match hypotheses. By caching which AfjJ(& t ,£y)'s 
correspond to Emaps and using a queue, we only operate on each node once. Hence the worst and 
best case performance of this operation is 0(M). 

3.3.4 Analysis of Gmap merge steps 

Global matches are formed in a three step process. The first step is to collect all of the consistent 
connected components of match hypotheses. This requires starting down the match hypothesis 
toots, walking downwards to find consistent structures. Each tree walk takes 0(Ni), where JV,- is 
the number of nodes in the current tree. If there are Nr roots, then the first merge step takes 
0{Nr * Ni). Assuming that most of the match hypotheses will appear in only one or two trees 
(some roots may share substructure), we can approximate this by saying that the first merge step 
is 0(M). Call the number of partial Gmaps formed at this stage Qp\. 

The complexity of the previous steps has been, perhaps surprisingly, small. Matching computa- 
tions usually have much worse performance, and we cannot completely escape this. In particular, 
a worst-case upper-bound Zcr the second and third merge steps is 0(N\) (although worst-case for 
one implies best-case for the other). 

The second merge step simply forms new Gmaps by combining those Gmaps from step one that 
intersect the same base structure. This is done by looping through each base description root, and 
checking which GMaps intersect the structure under that root. For each base description root, all 
possible consistent, maximal combinations of the Gmaps that intersect it are generated. In the 
worst case, every Gmap could intersect the same base structure. This would mean generating all 
possible consistent, maximal sets of Gmaps, which is the operation performed in step 3. We deter 
analysis of this case until then. In the other extreme, none of the Gmaps share a common base 
structure, and so step 2 requires 0($pi 2 ) operations, although this is not the best-case performance 
(see below). Typically, the second merge step is very quick and displays near best-case performance. 

The final set of Gmaps is formed, by generating all consistent combinations of Gmaps that 
exist after merge step 2. The complexity of this final merge step is directly related to the degree 
of "stracturalness" of the base and target domains and how many different predicates are in use. 
Worst-case performance occurs when the description language is flat (i.e., no higher-order structure) 
and the same predicate occurs many times in both the base and the target. Consider a language 
with a single, unary predicate, and base and target dgroups each consisting of indistinct facts. 
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In this case every base fact can match with every target fact, and will suggest matching in turn 
the entities that serve as their arguments. This reduces to the problem of finding all isomorphic 
mappings between two equal size sets. There are tf! such mappings. 

Now let us consider the best case* If the base and target dgroups give rise to a forest that has 
but one root, and that root is consistent, then there is only one Gmap! The second and third merge 
steps in this case are now independent of JV, and are constant-time. 

Of course, the typical case is somewhere between these two extremes. Typically the vocabulary 
of predicates is large, and the relationships between entities diverse. Structure provides a strong 
restriction on the number of possible interpretations for an analogy. By the time SHE gets to this 
phase of the generation process, many of the match hypotheses have been filtered out as being 
structurally impossible. Others have been permanently bound to entire sets of match hypotheses 
due to merge steps 1 and 2. In addition, our match rules require that relational predicates only 
match other predicates with the same name. Thus, SHE will perform badly on large dgroups with 
no structure. However, SHE will perform extremely well on large dgroups with deep networks of 
diverse higher-order relationships* Semantically, the former case roughly corresponds to a jumble 
of unconnected facts, and the latter case to a complex argument or theory. The better organized 
and justified the knowledge, the better SHE will perform. 

While we reported potential 0(N\) performance for the second merge step, our experience 
shows that this is one of the fastest sections of the algorithm. We suspect that 0(N\) behavior 
for this step may only be possible in theory and would never show up in practice. This is because 
worst-case performance occurs when all members of Gmapsi intersect the same base-structure and 
so must be merged in all possible ways (as in step 3). However, Gmaps intersecting the same base 
structure are almost always consistent with one another, mewing that step 2 would usually merge 
Gmapsi into one Gmap in O(Qpi) time! 

3*3*5 Analysis of Finding Candidate Inferences 

The candidate inferences are gathered by looping through the base description roots for each Gmap, 
collecting missing base facts whenever their structure intersects a match hypothesis in the Gmap. 
Each fact is tested to ensure that it is not already matched with part of the target description, or if 
it represents a contradiction of an existing target fact. The size of the typical candidate inference 
is inversely related to the number of roots: More roots implies less structure to infer, and vice 
versa. Thus in the worst case we have 0{$ * 7b * ?t)> or roughly 0(N A ). However, this is an 
extreme worst-case. First, the ft term implies that we check eve.7 target fact on each iteration. 
The algorithm actually only checks the pertinent target facts (i.e., those with the same functor), 
giving a more realistic worst-case of 0(N Z ). In the best case, there will only be one Gmap and no 
candidate inferences, producing constant time behavior. 

3.3.6 Analysis of Structural Evaluation Score computation 

The complexity of the belief maintenance system is difficult to ascertain. However, it is more or 
less irrelevant, since it is possible to eliminate it if detailed justifications of evidential results are 
not needed. In the prototype version of SHE [15], specialized evidence evaluation procedures were 
used which had most of the flexibility of the evidence rules, yet ran in 0(M) time. 

While the flexibility the BMS provides is valuable, we note that in fact the majority of SME's 
processing time takes place within it - typically 70 to 80%. However, in terms of real-time this has 
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not yet been a serious limitation. On the examples in this paper (and most of the examples we 
have examined), SHE runs in a matter of a few seconds on a Symbolics machine. 

4 Examples 

The Structure-Mapping Engine has been applied to over 40 analogies, drawn from a variety of 
domains and tasks. It is being used in psychological studies, comparing human responses with 
those of SHE for both short stories and metaphors. It is also serving as a module in a machine 
learning program called PHINEAS, which uses analogy to discover and refine qualitative models of 
physical processes such as water flow and heat flow. Here we discuss a few examples to demonstrate 
SME's flexibility and generality. 

4.1 Methodological constraints 

Flexibility is a two-edged sword. The danger in using a program like SME is that one could imagine 
tailoring the match construction and evidence rules for each new example. Little would be learned 
by using the program in this way — we would have at best a series of "wind-up toys" , a collection 
of ad-hoc programs which shed little theoretical light. Here we describe our techniques for avoiding 
tailorability. 

First, all the cognitive simulation experiments were run with a fixed collection of rule sets 
(three sets to be exact). Each rule set represented a particular type of comparison sanctioned 
by the Structure-Mapping theory (analogy, literal similarity, and mere appearance). As described 
above, these rule sets are listed in Appendix A. The mere appearance rule s (MA) care only about low- 
order matches: attributes and first-order relations. The analogy rules (TA) give strong preference 
to systems of relations and higher-order relations, while the literal similarity rules (LS) measure 
overall similarity. The first two examples in this section use the TA rules, while the last uses both 
TA and MA rules, as indicated. 

While the choice of match construction rules is fixed by Structure-Mapping, the values of 
evidence weights are not. Although we have not performed a sensitivity analysis, in our preliminary 
explorations it appears that the Gmap rankings are not overly sensitive to the particular values of 
evidence weights. (Recall that which Gmaps are constructed is independent of the weights, and is 
determined only by the construction rules and structural consistency.) 

Second, we have accumulated a standard description vocabulary which is used in all experiments. 
This is particularly important when encoding natural language stories, where the translation into a 
formal representation is underconstrained. By accumulating representation choices across stories, 
we attempt to free ourselves from "tailoring" particular examples. 

Third, we have tested SHE with descriptions generated automatically by other AI programs. A 
representation developed to perform useful inferences has fewer arbitrary choices than a represen- 
tation developed specifically for learning programs. So far, we have used descriptions generated 
by two different qualitative simulation programs. The results are encouraging. For example, SME 
actually performs better on a water-flow / heat-flow comparison using more complex descriptions 
generated by GIZMO [19] than on many hand-generated descriptions. 8 We are working on other, 
similar systems, as described in Section 6.3.1. 

*In fact, GIZMO stopped working before SME was built. 
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RUTHERFORD-ATOM 



Force Att ractsfsun, planet) Greater 





Revolve(electron, nucleus) 
Attracts (nucleus, electron) 



Figure 9: Solar System - Rutherford Atom Analogy. 



4.2 Solar System - Rutherford Atom Analogy 

The Rutherford model of the hydrogen atom was a classic use of analogy in science. A target system 
that was relatively unknown was explained in terms of the (relatively) well-understood behavior of 
the solar system. We illustrate SME's operation on this example with a simplified representation, 
shown in Figure 9. 

The Structure-Mapping Engine constructed three possible interpretations. The highest-ranked 
mapping (SES = 6.03) pairs up the nucleus with the sun and the planet with the electron. This 
mapping is based on the mass inequality in the solar system playing the same role as the mass 
inequality in the atom* It sanctions the inference that the differences in masses, together with 
the mutual attraction of the nucleus and the electron, causes the electron to revolve around the 
nucleus. This is the standard interpretation of this analogy. 

The other major Gmap (SES = 4.04) has the same entity correspondences, but maps the 
temperature difference between the sun and the planets onto the mass difference between the 
nucleus and the electron. The SES for this Gmap is low for two reasons. First, temperature and 
mass are different functions, and hence they receive less local evidence. The second reason is that, 
unlike the first Gmap, there is no systematic structure associated with temperature in the base 
dgroup, and thus other relations such as the match for ATTRACTS do not enter into this Gmap. 

The third Gmap is a spurious collection of match hypotheses which imply that the mass of the 
sun should correspond to the mass of the electron, and the mass of the planet should correspond 
to the mass of the nucleus. There is even less higher-level structure to support this interpretation 
and so SES = 1.87. 

This example demonstrates an important aspect of the Structure-Mapping account of analogy. 
Notice that the interpretation preferred on structural grounds is also the one with the most infer- 
ential import. This is not an accident; the systematicity principle captures the structural features 
of well-supported arguments. SME prefers interpretations based on a deep theory (i.e., a subset of a 
dgroup containing a system of higher-order relations) to those based on shallow associations (i.e., 
a subset of a dgroup with an assortment of miscellaneous facts). 



34 



The Structure-Mapping Engine 



29 



Water Flow History 

(Situation SO) 

(Decreasing (Pressure (At beaker SO))) 
(Increasing (Pressure (At vial SO))) 
(Decreasing (Amount-of (At beaker SO))) 
(Increasing (Auount-of (At vial SO))) 
(Greater (Pressure (At beaker SO)) 
(Pressure (At vial SO))) 

(Situation SI) 
(Meets SO SI) 

(Constant (Pressure (At beaker SI))) 
(Constant (Pressure (At vial SI))) 
(Constant (Amount -of (At beaker SI))) 
(Constant (Amount-of (At vial SI))) 
(Equal-To (Pressure (At beaker SI)) 
(Pressure (At vial SI))) 

(Function-Of (Pressure ?x) 

(Amount-of ?x)) 



Heat Flow History 

(Situation SO) 

(Decreasing (Temp (At horse-shoe SO))) 
(Increasing (Temp (At water SO))) 
(Greater (Temp (At horse-shoe SO)) 
(Temp (At water SO))) 



(Situation SI) 
(Meets SO SI) 

(Constant (Temp (At horse-shoe SI))) 
(Constant (Temp (At water SI))) 
(Equal-To (Temp (At horse-shoe SI)) 
(Temp (At water SI))) 



(Function-Of (Temp ?x) 
(Heat ?x)) 



Match 



Pressure +-+ 
Amount-of 

50 ~ 

51 ~ 
beaker «-+ 

vial 



Temperature 

Heat 

SO 

SI 

horse-shoe 
water 



Figure 10: Analogical match between water flow history and heat flow history. 



4.3 Discovering Heat Flaw 

The PHINEAS system is a program which learns by acting as a passive observer, relating ob- 
served physical phenomena to known theories of the world [13,14]. These theories are expressed 
as qualitative models of various physical processes, such as motion, boiling, and liquid flow, using 
Forbus' Qualitative Process Theory [17,18]. 

When PHINEAS is presented with a situation which the program's current models cannot explain, 
an analogical learning module is invoked which attempts to generate a new or revised model that 
can account for the new observation. This module u c « SME in two ways. First, once an analogous 
experience has been accessed (i.e., one which exhibits similar behavior), SME is used to form a match 
between the changes observed in the prior experience and the changes taking place in the current 
situation. After these correspondences have been established, the domain models used to explain 
the previous experience are fetched and SME is used a second time to construct a new model for the 
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Figure 11: Qualitative Process Theory model of liquid flow. 



current domain. 

For example, suppose that the program was presented with measurements of the heat flow 
situation in Figure 1 and described in Figure 10. If the domain model does not include a theory of 
heat flow, PHINEAS will be unable to interpret the new observation. 9 Using SME, the program is able 
to establish an analogy with the previously encountered water flow experience shown in Figure 1 
(see also Figure 10). This match serves to establish which properties from the two situations are 
behaving in the same way. As shown in Figure 10, the roles of the beaker and the vial in the 
water flow history are found to correspond to the roles of the horse shoe and water in the heat flow 
history, respectively. Those correspondences which provide a mapping between entities or between 
their quantities (e.g., Pressure and Temperature) are stored for later reference. 

When it is satisfied that the chosen water flow history is sufficiently analogous to the current 
situation, PHINEAS fetches the relevant domain theory which led to its prior understanding of the 
base (water flow) experience. Figure 11 shows a domain description for water flow which has been 
used in For bus' qualitative reasoning program [19,22]. This model (which is considerably more 
detailed than the liquid flow model shown previously) states that if we have an aligned fluid path 
between the beaker and the vial (i.e., the path either has no valves or if it does, they are all open), 
and the pressure in the beaker is greater than the pressure in the vial, then a fluid flow process 

g PHIHEXS uses the ATM! theory of measurement interpretation to explain observations. See [20) for details. 
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Gm&p #1: { (AM0UNT-0F-36 HEAT-VATE P> (AH0UNT-0F-33 HEAT-HSHOE) 

(PRESSURE-BEAKER TEMP-HSHOE) (PRESSURE-VIAL TEMP-WATER) } 
Eamps: { (beaker horse-shoe) (vial water) } 
Weight: 2 .676 

Candidate Inferences: (IMPLIES 

(AND (ALIGNED (*«kolem* pipe)) 

(GREATER-THAN (A TEMP-HSHOE) (A TEMP-WATER))) 

(AND (Q- (FLOW-RATE pi) (- TEMP-HSHOE TEMP-WATER)) 
(GREATER-THAN (A (FLOW-RATE pi)) zero) 
(1+ HEAT-WATER (A (FLOW-RATE pi))) 
(I- HEAT-HSHOE (A (FLOW-RATE pi))))) 

Figure 12: An Analogically Inferred Model of Heat Flow. 



will be active. This process has a flow rate which is proportional to the difference between the two 
pressures. The flow rate has a positive influence on the amount of water in the vial and a negative 
influence on the amount of water in the beaker. 

Using SME a second time, this theory is matched to the current heat flow situation, producing 
the output shown in Figure 12. The analogy at this stage is highly constrained, due to the set of 
entity and function correspondences established when the water flow and heat flow histories were 
matched. SME's rule-based architecture is critical to this operation: PHINEAS simply provides a new 
set of match constructor rules that only allow hypotheses consistent with the specific entity and 
function correspondences previously established. Entities and functions left without a match after 
the accessing stage are still left free to match other unmatched entities and functions. For example, 
the rule 

(MHC-rule ( filter ?b ?t :test (sanctioned-pairing? (lact-lunctor ?b) 

(lact-lunctor ?t))) 

(install-MH ?b ?t)) 

was used to force a match between those quantities which were found to be analogous during the 
beha/ioral similarity match (e.g., PRESSURE and TEMPERATURE) and prevent any alternate matches 
for these quantities (e.g., AMOUNT-OF and TEMPERATURE). 

This example demonstrates several points. First, the "analogy" here is composed almost entirely 
of candidate inferences, since the system had no prior model cf heat flow. Hence, the model was 
constructed by analogy rather than augmented by analogy. This shows the power of SME's candidate 
inference mechanism. Second, the example illustrates how SME's rule-based architecture supports 
situations in which the entity correspondences are given prior to the match, rather than derived 
as a result of the match. Finally, it shows the utility of introducing skolemized entities into the 
candidate inferences. The results produced by SME (Figure 12) contain the entity (*skolem* pipe) . 
This indicates that, at the moment, the heat path is a conjectured entity. At this time, the system 
inspects its knowledge of paths to infer that immersion or physical contact is a likely heat path. 
However, we note that much knowledge gathering and refinement may still take place while leaving 
the heat path as a conjectured entity. An example of this strategy in the history of science is when 
ether was postulated to provide a medium for the flow of light waves. 
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Base Story 

Karla, an old Hawk, lived at the top of a tall oak tree. One afternoon, she saw a hunter on the ground with 
a bow and some crude arrows that had no feathers. The hunter took aim and shot at the hawk but missed. Karla 
knew that hunter wanted her feathers so she glided down to the hunter and offered to give him a few. The hunter 
was so grateful that he pledged never to shoot at a hawk again. He went off and shot deer instead. 

Target Story - Analogy 

Once there was a small country called Zerdia that learned to make the world's smartest computer. 

One day Zerdia was attacked by its warlike neighbor, Gagrach. But the missiles were badly aimed and the attack 
failed. The Zerdian government realised that Gagrach wanted Zerdian computers so it offered to sell some of its 
computers to the country. The government of Gagrach was very pleased. It promised never to attack Zerdia again. 

Target Story - Mere* Appearance 

Once there was an eagle named Zerdia who donated a few of her tailfeathers to a sportsman so he would promise 
never to attack eaglet. 

One day Zerdia was nesting high on a rocky cliff when she saw the sportsman coming with a crossbow. Zerdia 
flew down to meet the man, but he attacked and felled her with a single bolt. As she fluttered to the ground Zerdia 
realised that the bolt had her own tailfeathers on it. 

Figure 13: Story Set Number 5. 



4.4 Modelling Human Analogical Processing 

SME is being used in several cognitive simulation studies. Our goal is to compare human responses 
with cftose of SME's for a variety of tasks and problems. For example, two psychological studies 
[29,50] have explored the variables that determine the accessibility of a similarity matcfc and that 
determine the inferential soundness of a match. Structure-Mapping predicts that the degree of 
systematic relational overlap will determine soundness [25]. A further hypothesis suggested by 
Structure Mapping is that surface similarity will determine accessibility [29]. The psychological 
studies supported both hypotheses. In order to verify the computational assumptions we then ran 
SME on the same examples. Here we briefly summarize the methodology and the results — for 
details see [57]. 

The hypotheses were tested psychologically as follows. Pairs of short stories were constructed 
which were alike in one of the three ways: mere appearance, analogy, or literal similarity. Subjects 
first read a large set of stories. Then, in a second session, subjects saw similar stories and tried to 
retrieve the original stories (the access measure). After that, the subjects were then asked to judge 
the inferential soundness of each of the story pairs. For the cognitive simulation study, five groups 
of stories (15 in all) were encoded. Then pairs of stories were presented to SME, using different rule 
sets corresponding to analogy (the AN rules) and mere appearance (the MA rules). The results 
from the AN rules were used to estimate soundness, while the results from the MA rules were used 
to estimate accessibility. One of these story groups will be discussed in detail, showing how SME 
was used to simulate a te3t subject. 

In the story set shown in Figure 13, the original story concerned a hawk named Karla who 
withstands an attack by a hunter. Two similar, target stories were used as potential analogies for 
the Karla narration. One was designed to be truly analogous (TA5) ar i describes a small country 
named Zerdia that withstands an attack by another country. The other story (MAS) was designed 
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Analogical Match from Karl* to Zerdia the country (TA5). 
Kilo Filt: ftpptiraico-aatck.rtloo I tab*: of Natch 37potbottt: 12 Iamb at of GMaps: 1 
Gmp tl: 

(HiPPimg-HOTTP HAPPIHES3-0ACAACH) (AITACX-TOHTtt ATTACK-CACiACH) (TAJOt-rEAIHOS BUT-SUPEICOKPtTTEK) 
(VAILUX-WItn WAliUK-CAOLACH) (DS8XU-FZATHE1S DESttl-StJraCQMPUTQ) 

(lAs-itAims OTE-supnctmnx) (offeweatbbis orra-supncmanrrno (vcapoi-bov heapoi-bow) 

&Mp«: (KAftLAl ZOLDXA12) (FKATHHS3 SUPnCOMPUTOllA) (CMSS-B0H4 MISSILES IB) (HUITEI2 GACUCH13) 
Wolfkt: 0. 411672 



Analogical Match from Karia to Zerdi* the eagle (MAS), 

tnU Filo: appntramci-aatck.rtXts Inbox of Mate* CTpotaoooo: 14 loabar of GMapo: 1 
0m? #1: 

(omt-rXAXEIU OFra-FXATBOS) (TAKX-FKATHERS TAK-rtATHZtS) (ATTACX-HUrm ATTia-SPOlISHAl) 
(SOC-UIU tUX-ZXMDU) CHAS-miBOU HAS-FEATHEIS) (BI1IMCA1U IM-ZODU) (IEAP0I-B0W WEAP0I-B0W) 

(Dciu-rtATms ocmi-rzATHns) cwailhe-hohtdl wailikk-sportsmah) crasoi-Huirn peisoi-spoitshu) 

EMpo: (FIATHDL53 FEATHEtSD) (OLQ3S-B0W4 OL0SS-B0W10) (I0I7EI3 SP0ET8MAI8) QCUT.il 2XIDXA7) 
Wolgkt: 7.70ZBU 



Figure 14: SME's Analysis of Story Set 5, Using the MA Rules. 



to be only superficially similar and describes an eagle named Zerdia who is killed by a sportsman. 
The representation of the Karla story given to SME was: 

(CAUSE (EQUALS (HAPPIXESS HUHTER) HIGH) 

(PROMISE HUHTER KARLA (NOT (ATTACK HUHTER KARLA)))) 
(CAUSE (OBTAIH HUHTER FEATHERS) (EQUALS (HAPPIHESS HUHTER) HIGH)) 
(CAUSE (OFFER KARLA FEATHERS HUHTER) (OBTAIH HUHTER FEATHERS)) 
(CAUSE (REALIZE KARLA (DESIRE HUHTER FEATHERS)) (OFFER KARLA FEATHERS HUNTER)) 
(FOLLOW (EQUALS (SUCCESS (ATTACK HUHTER KARLA)) FAILED) 

(REALIZE KARLA (DESIRE HUHTER FEATHERS))) 
(CAUSE (HOT (USED-FOR FEATHERS CROSS-BOW)) (EQUALS (SUCCESS (ATTACK HUHTER KARLA)) FAILED) ) 
(FOLLOW (SEE KARLA HUHTER) (ATTACK HUHTER KARLA)) 
(WEAPON CR0SS-B0W4) 
(KARLAS- ASSET FEATHERS3) 
(WARLIKE HUHTER2) 
(PERSON HUNTERS) 
(BIRD KARLA l) 

To test the hypothesis that accessibility depends on the degree of surface match, SME was run 
on both pairs using the mere-appearance match rules. This measured their degree of superficial 
overlap and thus, according to our prediction, the relative likelihood of their accessing the original 
story. The output of SME for the MA task is given in Figure 14, which shows that the eagle story 
(SES = 7.7) has a higher mere-appearance match rating than the country story (SES = 6.4). Thus, 
if the surface-accessibility hypothesis is correct, the MA story "Zerdia the eagle" should have led 
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Analogical Match from KarJa to Zerdia the country (TA5). 

Ralo Filo: wialo^y.rtilo Iimbtr of K*tc* Hypoth*«tt: W gutbtr of OUpt: 1 
fe»o fl: 

(CAUSE-PROMISE CAUSE-PROMISE) (SUCCESS -ATTACK SUCCESS -ATT Ad) (HAPPf-HUSTOL EAPPT-GACXACH) 
(HAPPUESS-HUITKR HAPPHESS-CAGRACH) (REALIZE-DESIRE REALIZE-JESIRE) (CAUSE-TAKE CAUSE-BUT) 

(ATTAa-Huma attack-cagrach) (desire-feathers desire-supercomputer) (failed- attack failed-attaci) 

(IAKK-FIAIHEU WT-5UPXXC0>runX) (CAUSE- FAIL©- ATTACK CAUSE-FAILED- AIIACK) 

CCAUSE-orm cause-after) ( follow-real xze foixow-realize) (has-feaihers usE-supacoxpuua) 

(CAUSE-HAFPT CAUSE-HAPPT) (IOT-ATTACK JOT- ATT ACE) (PROMISE-HUITER PROMISE) 
(IQI-HAS-FEATHEIS I!~-USE- SUPERCOMPUTER) (OFFER-FEATHERS OFFER-SUPERCOMPUTER) 

Eupi: (HICH23 HI0HI7) (FXATHERS20 SUPERCOMPUTERS (CROSS-B0W21 MISSILES1S) 
(HUITEE10 0AGRACH1S) (KARLA18 ZERDIA12) (FAILED22 FAILED16) 

W*i«ht: 22.363718 



Analogical Match from Karia to Zerdia the eagi* (MAS). 
InU Filo: aaUogy rnlN Rubor of HUtck Bypottotot: 47 Xoabor of GMapo: 1 

(PROMISE-HUITn PROMISE) (DESIRE-FEATHERS DESIRE-FEATHERS) (IAXE-FEATEEIS TAKE-FEATHERS) 
(CAUSE-OFFER CAUSE-OFFER) (OFTEE- FEATHERS OJTO-FEATHERS) (EAS-FEATHERS HAS-FEATHE1S) 
(REALIZE -DCS IRE REALIZE-DESIRE) (ATTACK-IUITER ATTACK-SPQRTSMAI) (I0I-AT7ACX IOT-ATTACK) 
(SUCCESS-ATTACK SUCCESS- ATTACK) (FOIXOV-SEE-ATTACK FOLLOW -SEE) (SEE-KARL A SEE-ZERDIA) 
(FAILED- ATTACK SUCCESSFUL- ATTACK) (CAUSE-TAKE CAUSE-TAKE) 

Eaaps: (FAILED22 TRUE11) (KARLA18 ZOOIA7) (HUITER10 SP0RTSMAI8) 
(FEATEERS20 FEAIHES9) (CROSS-BOW 1 CROSS-B0V10) 

Weight: Ifl. 816530 



Figure 15: SME's Analysis of Story Set 5, Using the TA Rules. 



to greater accessibility to the original story for the human subjects than the TA story "Zerdia the 
country" . 

To test the determinants of soundness, SME was again run on the above pairs of stories, this time 
using the analogy (AN) match rules. The output of SME for the TA task is given in Figure 15. Notice 
that "Zerdia the country" (SES = 22.4) was found to be a better analogical match to the original 
Karia story than "Zerdia the eagle" (SES = 16.8). Thus, according to Gentner's systematicity 
principle, it should be judged more sound by human subjects. 

The comparison of SME with human performance have been promising. For each of the nve 
story sets examined, the preferences of SME qualitatively agreed with those of the h iman subjects. 
First, SME replicates human access patterns when in mere-appearance mode (MA rules). SJE's 
rankings of surface similarity between the base and target are strongly correlated with accessibility 
of comparisons in people. Second, psycholog.^al evidence indicates that pcjple do indeed use 
systematicity aud consistency to rate the soundness of a match. As pre^cted, SME replicates 
subject's soundness rankings when in analogy mode (AN rules). These results help confirm that 
the similarity matches programmed into SME's AN and MA rules do a reasonable job >f capturing 
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Table 1: SME performance on described examples. 



Example 


Niimber base 
facts/entitles 


Number target 
facts/entities 


#MH's 


# Gmaps 


Total BMS 
run time 


Total match 
run time 


Simple Water-Heat 


"/« 1 


6/4 


14 


3 


0.70 


0.23 


Solar System-Atom 


12/2 


9/2 


16 


3 


0.91 


0.28 


PHXHEAS behavioral 


40/8 


27/6 


69 


6 


9.68 


1.92 


PHXIEAS theory 


19/11 


13/6 


10 




0.17 


';.66 


Base5-TA5 (AN) 


26/6 


24/6 


54 




5.34 


0.87 


Baee5-MA5 (AN) 


26/6 


24/5 


47 




4.55 


0.98 


Base5-TA5 (MA) 


26/6 


24/6 


12 




0.38 


0.36 


Baw5-MA5 (MA) 


26/6 


24/5 


14 




0.73 


0.46 



VOTE: All times are given in secondi. Total match time 5 total SMS run time minus BMS run time. 



two different kinds of similarity matches that peopie implicitly use. 

We make two additional observations. First, the results demonstrate the considerable leverage 
for cognitive modeling that SME's architecture provides. We know of no other general-purpose 
matcher which successfully simulates two distinct kinds of human similarity comparisons. Second, 
the shot story analogies show that SME is capable of matching large structures as well as the 
smaller, simpler structures shown previously. 

4.5 Review of Performance 

SME is written in Comuon Lisp and the examples in this paper were run on a Symbolics 3640 with 
8 megabytes resident memory. A variety of examples have been presented, representing a wide 
range of complexity. Table 1 shows a comparison of how well SME performed for each example. To 
give an accurate account of how fast SME is, we have separated the BMS run time from the total 
SME run time, since the computational cost of the BMS can be removed if additional inspectability 
is not needed. The run times are given in seconds. From these times it can be seen that SME is 
extremely fast at producing unevaluated Gmaps. In fact, it would seem to be close to linear in 
the number of match hypotheses and in the number of base and target facts. The majority of the 
run time is spent within the BMS, producing structural evaluation scores. However, the total run 
times are sufficiently short that we have opted to continue using the BMS for the time being, since 
it has proven to be a valuable analysis tool. 

The longest runtime occurred for the behavioral match between the water flow and heat Sow 
observations (PHINEAS behavioral). While the descriptions for this example were the largest, the 
primary source of slowdown was the flat representations used to describe the situations. 

5 Comparison With Other Work 

The Structure-Mapping theory has received a great deal of convergent theoretical support in artifi- 
cial intelligence and psychology. Although there are differences in emphasis, there is now widespread 
agreement on the basic elements of cne-to-one mappings of objects with carryover of predicates 
([3,4,34,36,41,42,51,53,66]). Moreover, several of these researchers have adopted special cases of 
the systematicity principle. For example, Carbonell focuses on plans and goals as the high-order 



41 



The Structur&Mapping Engine 



36 



relations that give constraint to a system, while Winston [65] focuses on causality. Structure- 
Mapping theory subsumes these treatments in three ways. First, it defines mapping rules which are 
independent of particular domains or primitives. Second, the Structure-Mapping characterization 
applies across a range of applications of analogy, including problem solving, understanding expla- 
nations, etc. Third, the Structure-Mapping account treats analogy as one of a family of similarity 
comparisons, each with particular psychological privileges, and thus explains more phenomena. 

Some models have combined an explicit Structure-Mapping component to generate potential 
interpretations cf a given analogy with a pragmatic component to select the relevant interpretation 
(e.g., [3,42]). Given our experience with PHINEAS, we believe SME will prove to be a useful tool for 
such systems. 

SME computes a structural match first, and then uses this structural match to derive candidate 
inferences. The implementations of Winston [65,66] and Burstein [3] are similar to SME in this 
respect. An alternate strategy is used by Kedar-Cabelli [42] and Carbonell [4]. These programs 
do not perform a match per but instead attempt to carry over "relevant" structure first and 
modify it until it applies to the target domain* The match arises as an implicit result of the 
structure modification. We know of no complexity results available for this technique, but we 
suspect it is much worse than SME. It appears that there is great potential for extensive search 
in the modification method. Furthermore, the modification method effectively requires that the 
access mechanism is able to provide only salient structures (e.g., purpose-directed [42]), since the 
focusing mechanism of a partial match is not present. This means these systems are unlikely to 
ever derive a surprising result from an analogy. 

A very different approach is taken by Holyotk [39]. In this account, there is no separate 
stage of structural matching. Instead, analogy is completely driven by the goals of the current 
problem-solving context. Retrieval of the base domain is driven by an abstract scheme of current 
problem-solving goals. Creating the mapping is interleaved with othev problem-solving activities. 
This "pragmatic" account, while appeJing in some ways, has several crucial limitations. First, the 
pragmatic model has no account of soundness. Without structural consistency, the search space for 
matching explodes (see below). Second, analogy is used for other purposes than problem solving. 
These purposes include contexts where relevance does not apply. Thus an analog} interpretation 
algorithm that requires relevance cannot be a general solution [26,27]. Analogy can be used to 
explain a new concept and to focus attention on a particular aspect of a situation. Analogy can 
result in noticing commonalities and conclusions that are totally irrelevant to the purpose at hand. 
Third, psychological data indicates that access is driven by surface similarity, not relevance, as 
described previously. 

We believe the modularity imposed by t,he Structure-Mapping account has several desirable 
features over the pragmatic account. In the Structure-Mapping account, the same match procedure 
is used for all applications of analogy. For example, in a problem-solving environment, current plaas 
and goals influence what is accessed. Once base and target axe both present, the analogy mapping 
is performed, independently of the particular context. Its results can then examined and tested as 
part of the problem-solving process (see [26,27]). 

SME demonstrates that an independent, structural matcher can be built which is useful in several 
tasks and for a variety of examples (over 40 at this writing). By contrast, no clear algorithms have 
been presented based on the pragmatic account, and published accounts so far [38] describe only 
two running examples. Another problem concerns potential complexity. We have not yet developed 
a good "typical case" analysis of SME's complexity, but empirical evidence indicates that it is quite 
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reasonable. The reason is that Structure-Mapping focuses on local properties of the representation. 
On the other hand, the pragmatic account appears to involve arbitrary inference, and arbitrary 
amounts of knowledge, h the mapping process. Thus we would expect that the average-case 
computational complexity of a pragmatically oriented matcher will actually be much worse than 
SHE. 

5.1 Matching Algorithms 

To our knowledge, SME is unique in that it generates all structurally consistent analogical mappings 
without search. Previous matchers search through the space of possible matches and often return 
a single, best match (e.g., [11,35,43,46,60,61,64,65,66]). Some research on analogy has concluded 
that generating all possible interpretations is computationally intractable [35,65,43]. Our analysis 
and empirical results indicate that *his conclusion must be substantially modified. Only when 
structural consistency does not exist, or is ignored, does the computation become intractable. For 
instance, in [43] the knowledge base was uniform and had no higher-order structure. In such cases 
exponential explosions are unavoidable. 

Winston's matcher [65,66] heuristically searches for a single best match. It begins by enumer- 
ating all entity pairings and works upward to match relations, thus generating all N^!/(N^-N^)! 
possible entity pairings. Because SHE only introduces entity pairings when suggested by potential 
shared relational structure, it typically generates many fewer entity pairings. Some limited amount 
c' uning due to domain-specific category information was also available on demand, such as re- 
quiring that males match with males. By contrast, SME ignores attributes when in analogy mode, 
unless they play a role in a larger systematic structure. Winston's scoring scheme would attribute 
one point for each shared relation (e.g., LOVE, CAUSE), property (e.g., STRONG, BEAUTIFUL), and 
class classification (e.g., A-KIND-QF(?x, woman)). Unlike SME's analogy rule3, this scheme makes 
no distinction between a single, systematic relational chain and a large collection of independent 
facts. 

Kline's RELAX system [43] focused on matching relations rather than entities. RELAX did 
not attempt to maintain structural consistency, allowing many-to-one mappings between entities 
or predicate instances. In conjunction with a semantic network, RELAX was able to match items 
having quite different syntax (e.g., (Segment Al A2) matching (Angle Al X A2)). However, there 
was no guarantee that the best match would be found due to local pruning during search. 

Programs for forming inductive generalizations have also addressed the partial matching prob- 
lem. These systems use a heuristically pruned search to build up sets of correspondences between 
terms which are then variablized to form generalized concept descriptions. Since these systems were 
not designed for analogy, they resemble the operation of SH5 programmed as a literal graph matcher 
(e.g., they could not match Pressure to Temperature). Hayes-Roth <k McDermott's SPROUTER 
[35] and Diettrich & Michalski's INDUCE 1.2 [11] possess our restriction of one-to-one consistency in 
matching. Vere's THOTH system [60,61] uses less stringent match criteria. Once the initial sets of 
matched terms are built, previously unmatched terms may be added to the match if their constants 
are in related positions. In the process, THOTH may allow many-to-one mappings between terms. 

The usefulness of many-to-one mappings in matches has been discussed in the literature 
[35,43]. Hayes-Roth ic McDermott [35] advocate the need for many-to-one mappings among en- 
tities. Kline [43] calls for many-to-one mappings between propositions as well. For example, 
Kline points out that in trying to match a description of National League baseball to American 
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League baseball, the statement (male NLpitcher) should match both (male ALpitcher) and 
(male ALdeaignatedhitter) . 

We disagree with this view, since we believe that structural consistency is central to analogy. 
Many-to-one mappings are permitted in artistic metaphor, but are not allowed in explanatory, 
predictive analogies [24,32]. However, we agree that multiple mappings are sometimes useful [8]. We 
propose to solve the problem by viewing many-to-one mappings as multiple analogies between the 
same base and target. Since SME produces all of the interpretations of an analogy, a postprocessor 
could keep more than one of them to achieve the beneficial effects of multiple interpretations, 
without sacrificing consistency and structural clarity. Thus, in the baseball example, SME would 
produce an offense interpretation and a defense interpretation. Postprocessing would combine 
these two Gmaps to form a single National League - American League analogy which mapped the 
National League pitcher into both the American League pitcher and American League designated 
hitter. 



6 Discussion 

We have described the Structure-Mapping Engine, a tool-kit for building matchers consistent with 
Gentner's Structure-Mapping theory of analogy and similarity. SME is both efficient and flexible. 
A particular matching algorithm is specified by a set of constructor rules and evidence rules. It 
produces all structurally consistent interpretations of a match, without backtracking. The interpre- 
tations include the candidate inferences suggested by the match and a structural evaluation score, 
giving a rough m tasure of quality. We believe we have described SME's algorithm in sufficient detail 
to allow successful replication by other interested researchers. 

SME has been used both in cognitive simulation studies and a machine learning project. In the 
cognitive simulation studies, the results so far indicate that SME, when guided with analogy rules, 
replicates human performance. In the machine learning project (PHINEAS), SME's flexibility provides 
the means for constructing new qualitative theories to explain observations. 

While our complexity analysis indicates that SME's worst-case performance is factorial, the em- 
pirical experience is that the typical behavior is much better than that. Importantly, the character- 
istic which determines efficiency is not size, but the systematicity of the knowledge being analyzed. 
Unlike many AI systems, the more systematic the knowledge, the better SME will perform. 

In this section we discuss some broader implications of the project, and sketch some of our plans 
for future work. 

6.1 Implications for representation 

The SME algorithm is sensitive to the detailed form of the representation. It must be, since we are 
forbidding domain-specific inference in the matching process. Existing AI systems rarely have more 
than one or two distinct ways to describe any particular situation or theory. But as our programs 
grow more complex (or as we consider modeling the range and depth of human knowledge) the num- 
ber of structurally distinct representations for the same situation is likely to increase. For example, 
a story might be represented at the highest level by a simple classification (i.e., GREEK -TRAGEDY), 
at an intermediate level by relationships involving the major characters (i.e., (CAUSE (MELTING 
WAX) FALL23)), and at the lowest level by something like conceptual dependencies. An engineer's 
knowledge of a calculator might include its functional description, the algorithms it uses, and the 
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axioms of arithmetic expressed in set theory. Unless there is some "window of overlap" between 
the levels of description for base and target, no analogy will be found. When our programs reach 
this complexity, how will SME cope with this situation? 

There are several possible approaches to this problem. Consider the set of possible representa» 
tions for a description. Assume these representations can be ordered (or at least partially ordered) 
in terms of degree of abstraction. If two descriptions are too abstract, there will be no predicate 
overlap (GREEK-TRAGEDY versus SHAKESPEARE-DRAMA) . If two descriptions are greatly detailed, there 
will be too many spurious matches (e.g., describing the actions of characters every microsecond). 
The problem is to find levels of description which provide useful analogies. We believe one solution 
is to invoke SME repeatedly, using knowledge of the definitions of predicates to "slide" the base or 
target descriptions up or down in the space of possible representations appropriately. 

An orthogonal consideration is the degree of systematic: ty. Worst-case behavior can occur 
frequently when representations are large and relatively flat. Changes in representation can make 
large differences. For example, a PHINEAS problem which took .. was reduced to .. by imposing 
more symbolic structure. 

6.2 Addressing the Combinatorics 

As we have shown, SME is 0(N 2 ) except for the last critical merge step, which has 0(N\) worst- 
case performance. Our experience has found that even moderately structural domain descriptions 
produce excellent performance. However, in practice it is not always convenient to avoid traditional, 
fiat domain representations. For example, SME is unable to duplicate Kline's baseball analogy [43] 
within a reasonable amount of tixue (hours). This is due to his fiat description of the domain 
(e.g., (MALE catcher), (BATS lelt-lielder), (BATS center-fielder), etc.). Thus for some 
cases, generating all possible interpretations of an analogy may be prohibitive. Previous work in 
this area has produced matching algorithms that are specifically designed around heuristic search 
mechanisms. SME offers a clean line between generating all possibilities and imposing heuristic 
limitations. If we stop after the first merge step, SME represents an 0(N 2 ) algorithm for generating 
the complete set of minimal Gmaps! The subsequent merge steps could then be heuristically 
driven through a limited search procedure (e.g., beam-search, best-first, etc.) to produce the best 
or N best maximal interpretations. Alternatively, we could retain the current SME design (recall that 
the second merge step is required to support candidate inference generation and is almost always 
0(N 2 ) or better) and simply drop the troublesome third merge step. This is an (unused) option 
that the current implementation provides. We have not yet explored the ramifications of dropping 
merge step 3, although work with PHINEAS has shown the need for the maximality criterion in 
practice. 

Tr* the next sections, we discuss the potential for parallel versions of the SME algorithm. In par- 
ticular, we argue that (1) there are many opportunities for parallel speedup, and (2) the expensive 
merge steps can be eliminated in principle. 

6.2.1 Medium-grained Parallel Architectures 

We begin by examining each stage of the algorithm to see how it might be decomposed into parallel 
operations, and what kinds of speedups might result. 
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Constructing Match Hypotheses Running the match constructor rules is a purely local, and 
can be broken down into one task for each pair of base it em/ tar get item which will either 
produce a hypothesis or not. With enough processors, this step becomes unit time. 

Computing Conflicting, EMaps, and NoGood sets Computing the Conflicting set is a completely 
independent operation. It could either be organized around each base or target item, or 
around pairs of match hypotheses. Finding the EMaps and NoGood sets require propagation 
' of results upwards, and hence must take time proportional to the maximum depth of the the 
match hypotheses. 

Merge Step 1: Form initial combinations Recall that this step starts from the roots of the 
match hypothesis forest, adding the subtree to the list of Gmaps if tlie hypothesis is not 
inconsistent and recursing on its offspring otherwise. The results from each root are inde- 
pendent, and so may be done as separate tasks. If each recursive step spawns a new process 
to handle each offspring, then the minimum time is proportional again to the depth of the 
highest tree in the forest. 

Merge Step 2: Combine dependent but unconnected Gmaps Recall that this step com- 
bines initial Gmaps which share common base structure and are not inconsistent when taken 
together. This procedure can be carried out bottom-up, merging pairs which are consistent 
and then iterating on the results. The computation time will be logarithmic in the number 
of Gmaps, assuming enough processors and ignoring allocation time. 

Merge Step 3: Combine independent collections This can be performed like the previous 
step, but requiring that pairs of Gmaps not have common structure. Again, with enough 
processors the time is bounded by the log of the number of Gmaps. However, since the 
number of Gmaps is in the worst case factorial, the number of tasks required can become 
rather large. 

The analysis above is cursory at best, and there are no doubt several problems lurking in 
creating a highly parallel version of the SME algorithm. However, we believe such algorithms could 
he very promising. 

SMS's simplicity also raises another interesting experimental possibility. Given that currently 
many medium-grain parallel computers are being built with reasonable amounts of RAM and a lisp 
environment on each machine, one can imagine simply loading a copy of SME into each processor. 
Access experiments, for example, would be greatly sped up by allowing a pool of SMEs to work over 
the knowledge base in a distributed fashion. 

6.2.2 Connectionist Architectures 

Another' interesting approach would be to only generate a single, best Gmap while still maintaining 
SME's "no search" policy. The problem of choosing among all possible interpretations in analogy 
processing is very much like choosing among possible interpretations of the sentence "John shot two 
bucks" in natural language processing. A "no search" solution to this natural language problem 
was provided by the connectionist work of Waltz and Pollack [63]. In fact, it was their work in 
connectionist models of natural language processing that inspired certain aspects of the design of 
SME. 
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Connectionist architectures allow all possible answers to be implicitly represented at once 
[16,63]. Rather than explicitly constructing all possible sentence interpretations and then choos- 
ing the best one, Waltz and Pollack ucad their networks to implicitly represent all of the possible 
choices. Given a particular network, spreading activation and lateral inhibition were used to find 
the single best interpretation. 

Consider the network produced by SNE prior to the Gmap merge steps (shown in Figure 5). 
Some match hypotheses support each other (grounding criterion) while others inhibit each other 
( Conflicting relations) . Viewing this as aspreading activation, lateral inhibition network, it appears 
that standard connectionist techniques could be used to produce the best interpretation without 
explicitly generating all Gmaps. Furthermore, it may be possible to generate the second-best, 
third-best, etc. interpretations on demand by inhibiting the nodes of the best interpretation, 
forcing the second best to rise. Thus SME would be able to establish a global interpretation simply 
as an indirect consequence of the establishment of local consistency. Tnis would eliminate the single 
most expensive computation of the SHE algorithm. By eliminating explicit generation of all Gmaps, 
the complexity of the algorithm could drop to the 0(N 2 ) required to generate the connectionist 
network. 

6,3 Future Work 

6.3*1 Cognitive Simulation 

We have several cognitive simulation studies of analogical reasoning, memory, and learning involving 
SME in progress. We mention only one here. Psychological research shows a marked developmental 
shift in analogical processing. Young children rely on surface information in analogical mapping; at 
older ages, systematic mappings are preferred [30,31,40,62]. -Further, there is some evidence that a 
similar shift from surface to systematic mappings occurs in the novice-expert transition in adults 
[7,44,51,52]. 

In both cases there are two very different interpretations for the analogical shift: (1) acquisition 
of knowledge; or (2) a change in the analogy algorithm. The knowledge-based interpretation is 
that children and novices lack the necessary higher-order relational structures to guide their analo- 
gizing. The second explanation is that the algorithm for analogical mapping changes, either due 
to maturation or learning. In human learning it is difficult to decide this issue, since exposure to 
domain knowledge and practice in analogy and reasoning tend to occur simultaneously. SME gives 
us a unique opportunity to vary independent!*' the analogy algorithm and the amount and kind 
of domain knowledge. For example, we can compare identical evaluation algorithms operating on 
novice versus expert representations, or we can compare different analogy evaluation rules operat- 
ing on the same representation. The performance of SME under these conditions can be compared 
with novice versus expert human performance. 

We are also exploring ways to reduce the potential for tailorability in the process of translating 
description? provided as experimental stimuli for human subjects into formal representations for SME 
input. For example, Janice Skorstad is creating a graphical editor for producing graphical figures for 
experimental stimuli. One output of the editor is a printable picture, the other is a set of symbolic 
assertions with numerical parameters. The assertions are then passed into a simple inference engine 
(using a logic-based truth-maintenance system) which creates the relational structure SME requires 
by using rules to calculate interesting relationships, such as INSIDE or LEFT-OF. 

Inspired by Winston's use of a pidgin- English parser for input [66], we are also seeking i 
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parser that, perhaps in coiyunction with a simple inference engine, can produce useful descriptions 
of stories. Unfortunately, the parsers we have examined so far require input that is sufficiently 
restrictive that our subjects may find the stories unnatural. 

6.3.2 Machine Learning Studies 

While Falkenhainer's PH1NEAS program is already capable of some learning behaviors, there is still 
more work to do. He is currently adding to its armatorium of knowledge-refinement techniques, 
and investigating access techniques appropriate for implementation on serial computers. He is also 
running PHINEAS on a number of additional examples to better test his theories. 

PHINEAS is part of a larger machine learning project underway at University of Illinois, called 
the Automated Physicist Project This project, spearheaded by Forbus and Gerald DeJong, also 
at Illinois, is building a collection of programs that use qualitative and quantitative techniques 
for reasoning and learning about the physical world. DeJong and his students have already built 
several interesting programs that use Explanation-Based Learning [9,10] to acquire knowledge of 
the physical world [55,49]. Forbus' group has already developed a number of useful qualitative 
reasoning programs [20,21,37] which can be used in learning projects (as PHINEAS demonstrates). 
By combining these results, we hope to build systems that can reason about a wide range of physical 
phenomena and learn both from observation and by being taught. 
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A SME Match Rules 

The construction of a match is guided by a set jf match rules that specify which facts and entities 
in the base and target might match and estimate the beiievability of each possible component of a 
match. In our experiments using SME, we currently use three types of rule sets, literal similarity, 
analogy, and mere appearance. 

A.1 Literal Similarity (LS) Rules 

The literal similarity rules look at both relations and object descriptions. 

;;;; Define MH constrvctor rules 

;; If predicates are the samt, match them 

(MHC-rola (:filtar Tb ?t :tast (aq (f act-functor ?b) (f act-functor ?t))) 
(inatall-MH Tb ?t)) 

;; Intern rule for non-commutaUvt predicates - corresponding arguments only. 
;; Match compatible argument* of already matched items 

(MHC-rula (:intarn Tb Tt :tast (and (factT Tb) (factT Tt) 

(not (comntatiYaT (fact -functor Tb))) 
(not (coBBtttatiTtT (f act-functor ?t))))) 
(do ((bchildran tfact-argu»anta Tb) (cdr bchildran)) 
(tchlldran (fact-argunanta Tt) (cdr tchlldran))) 
((or (null bchlldran) (null tchlldran))) 
(cond ((and (antityT (first bchlldran)) (antityT (first tchildren))) 
(install-HH (first bchildran) (first tchlldran))) 
((and (functlonT (fact-functor (first bchildran))) 
(function? (fact-functor (first tchlldran)))) 
(install-HH (first bchildran) (first tchlldran)))))) 

;; Intern rule for commutative predicates - any "compatible* arguments, regardless of order. 
;; Match compatible arguments of already matched items 

(MHC-rula (:intarn ?b ?t :tast (and (factT Tb) (fact? ?t) 

(coaautatlTaT (fact-functor Tb)) 
(comutatiTaT (f act-furttor ?t)))) 
(dollst (bchild (fact-argunants Tb)) 
(dollst (tchild (fact-arguaanta Tt)) 
(cond ((and (antityT bchild) (antityT tchild)) 
(lnstall-MH bchild tchild)) 

((and (functionT (fact-functor bchild)) (function? (fact-functor tchild))) 
(install-MH bchild tchild)))))) 

;;;; Define MH evidence rules 

;; having the same functor is a good sign 

n 

(assart! aana-functor) 

(rula ((:intarn (MH ?b ?t) :ttst (and (fact? ?b) (fact? *t) 

(tq (facc-functor ?b) (fact-functor 7 t))))) 

(if (function? (fact-functor ?b)) 

(assart! (iaplits saaa-f unctor (MH ?b Tt) (0.2 . 0.0))) 
tasstrt! (iaplits saaa-f unctor (MH ?b ?t) (0.6 0.0))))) 
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;;chtck children (arguments) matth potential 

99 

(inltial-atfsartion (assert! 'children -potential)) 

(rule ((:intern (MH ?b ?t) :t*st (tad (fact? ?b) (fact7 ?t)))) 
(if (children-aatch-potential Tb ?t) 

(assert! (implies children-potential (MR 7b 7t) (0.4 . 0.0))) 
(assart) (implies children-potential (MH Tb ?t) (0.0 . 0.8))))) 

,71/ their order is similar, this is good. If the item is a function, 
;; ignore since order comparisons give false support here, 
tt 

(initial-assertion (assart! 'order-similarity)) 

(rula ((:intern (MH ?b ?t) :test (and (fact7 ?b) (fact7 7t) 

(not (function7 (fact-functor 7b))) 
(not (function? (fact-functor ?t)))))) 
(cond ((- (fact -order Tb) (2act-order 7t)) 

(assart! (Uplias order-similarity (MH 7b ?t) (0.3 . 0.0)))) 
((or (• (fact-order 7b) (!♦ (f act-ordar ?t))) 
(» (f act-ordar Tb) (1- (f act-ordar Tt)))) 
(assart! (implies order-similarity (MB ?b ?t) (0.2 . 0.05)))))) 

;,yropagate evidence down - systematicity 

;; support for the org will be 0.8 of the current support for the parent 

it 

(rula ((:intarn (MH ?bl ?tl) :tast (and (factT Tbl) (factT TtX))) 

(:intarn (MH 7b2 7t2) :tast (corresponding-arguments? 7b2 7t2 ?bl ?ti))) 
(assart! (iaplias (MH ?bX ?tl) (MH 7b2 7t2) (0.8 . 0.0)))) 

;;;; Gmap rules 

,7 Support from its MH's. At this time we ignore other factors such as number 
;; of candidate inferences, etc. 

(rula ((: intern (GMAP ?ag))) 

(dollst (mh (gm-elements ?eg)) 

(assart! '(iaplias . (ah-f or* ah) (GMAP 7 eg) ) )) ) 

A.2 Analogy (AN) Rales 

The analogy rules prefer systems of relations and discriminate against object descriptions. The 
analogy evidence rules are identical to the literal similarity evidence rules. The match constructor 
rules only differ in their check for attributes: 

,7,7 Define MH constructor rules 

,7 // predicates are the same, match them 

(MHC-rule (:filter ?b ?t .test (and (aq (fact-functor ?b) (fact-functor 7 t)) 

(not (attribute? (fact-functor *b))))) 

(install -MH ?b ?t)) 
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;; Match compatible arguments of already matched items* 

;; Notice attributes are allowed to match here, since they are part of some higher relation that matched. 
;; Intern rule for non- commutative predicates - corresponding arguments only. 

(MHC-rula (:intern 7b ?t :teat (and (fact? ?b) (fact? ?t) 

(not (commutative? (fact-functor Tb))) 
(not (commutative? (fact-functor ?t))))) 
(do ( (bchlldren (f act-argumenta Tb) (cdr ©children)) 
(tcnildran (f act -argument • ?t) (cdr tchildren))) 
((or (null bcnlldran) (null tchildren))) 
(cond ((and (entity? (flrat bcnlldran)) (entity? (flrat tcnildran))) 
(lnatall-MH (flrat bcnlldran) (flrat tcnildran))) 
((and (function? (fact-functor (flrat bcnlldran))) 
(function? (fact -functor (flrat tcnildran)))) 
(inatell-MH (flrat bcnlldran) (flrat tcnildran))) 
((and (attrlbuta? (fact-functor (flrat bcnlldran))) 

(aq (fact-functor (flrat bcnlldran)) (fact-functor (flrat tcnildran)))) 
(lnatall-MH (flrat bcnlldran) (flrat tcnildran)))))) 

;; Intern rule for commutative predicates - any 'compatible 9 arguments, not necessarily corresponding. 

(MHC-rula (:intam 7b ?t rtaat (and (fact? 7b) (fact? ?t) 

(commutative? (fact-functor ?b)) 
(commutative? (fact-functor ?t)))) 
(dollat (bcnlld (fact-argunanta ?b)) 
(dollat (tcnlld (fact -arguments ?t)) 
(cond ((and (entity? bcnlld) (entity? tcnlld)) 
(ln«tall-MH bcMld tcnlld)) 
((and (function? (fact-functor bcnlld)) 
(function? (fact -functor tcnlld))) 
(inetall-MH bcnlld tcnlld)) 
((and (attrlbuta? (fact-functor bcnlld)) 

(aq (fact -functor bcnlld) (f act-f jnctor tcnlld))) 
(lnatall-MH bcnlld tcnlld)))))) 

A. 3 Mere Appearance (MA) Rules 

The mere appearance rules focus on object descriptions and prevent matches between functions or 
relations. As a result, the number of evidence rules is greatly reduced. 

;;;; Define MH constructor rules 

(MHC-rula (:filtar ?b ?t :taat (and (aq (fact-functor ?b) (fact-functor ?t)) 

(<■ (fact-order ?b) i) 
(<« (fact-order ?t) 1))) 

(lnatall-MH ?b ?t)) 

(MHC-rula (:inte^ ^ °t :teat (and (tact? ?b) (fact? ?t) 

(not (commutative? (fact-functor ?b))) 
(not (commutative? (fact -functor ?t))))) 
(do ((bcnlldran (f act-argumenta ?b) (cdr ©children)) 
(tcnildran (f act-argumenta ?t) (cdr tcnildran))) 
((or (null bchlldren) (null tcnildran))) 
(if (and (entity? (first bchlldren)) (entity? (flrat tchildren))) 
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(in*tall-HH (fjrit b children) (first tchlldren))))) 

(MHC-rule Ointern ?b ?t :tett (and (fact? ?b) (fact? ?t) 

(commutatire? (fact-functor ?b)) 
(commutative? (fact-functor ?t)))) 
(dolitt (bchild (fact-arguments ?b)) 
(doliat (tchild (f act-argumenti ?t)) 
(if (and (entity? bchild) (antity? tchild)) 
(instali-MH tchild tchild))))) 

;;;; Define MH evidence rules 

;; having the same functor is a good sign 

if 

(initial-assertion (assart! *s am* -functor)) 

(rola ((:intarn (MH ?b ?t) rtast (and (fact? ?b) (fact? ?t) 

(tq (fact-functor ?b) (fact -functor ?t))))) 
(cond ((attribute? (fact -functor ?b)) 

(assart! (implies same-functor (MH 7b ?t) (0.5 . 0.0)))) 
((■ 1 (max (fact-ordar ?b) (fact-order ?t))) 
(assert! (implies same-functor (MH ?b ?t) (0.4 . 0.0)))))) 

Wropagrte evidence down - only for entity MH's caused by attribute pairings 
;; support for the arg will be 0.9 of the current support for the parent 

» 

(rule (dintem (MH ?M ?ti) :test (and (fact? ?bl) (fact? ?tl) 

(<■ (max (fact-oruer ?bl) (fact -order ?tl)) 1))) 
Ointern (MH ?b2 ?t2) :test (corresponding- arguments? ?b2 ?t2 ?bl ?ti))) 
(assert! (implies (MH ?bl ?tl) (MH ?o2 ?t2) (0.9 . 0.0)))) 

;;;; Gmap rules 

;;; Support from its MH's. At this time we ignore other factors such as number of candidate inferences 

(rule (dintern (GMAP ?eg))) 

(dolist (ah (ga-elements ?eg)) 

(assert! '(implies .(mh-formmh) (GMAP ?eg))))) 

B Sample Domain Descriptions 

In this section we show the domain descriptions given to SME for the described examples. 

B.l Simple Water Flow - Heat Flow 

Water Flow 

(def Entity water :type inanimate) 
(def Entity beaker :type inanimate) 
(def Entity vial :type inanimate) 
(def Entity pipe . type inanimate) 
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(defDeecription simple-water-flow 
entities (water beaker rial pipe) 
facts (((flow beaker rial water pipe) inane wflov) 
((preaaure beaker) inane pressure -beaker) 
((preaaure Tlal) inane pressure -rial) 
((graatar pressure-beaker preaaure-rial) inane >prassura) 
((greater (diameter beaker) (diameter Tial)) inane >dia*eter) 
((cause >pr«asura wf low) inane cause-flow) 
(flat-top water) 
(liquid water))) 



(def Entity eqffee itype inanimate) 
(def Entity ice-cube :type inanimate) 
(def Entity bar :tyoe inanimate) 
(def Entity heat :type inanimate) 

(def Description simple-heat-flow 

entities (coffee ice-cube bar heat) 
facta (((flow coffee ice-cube heat bar) :name hflow) 
((temperature coffee) mane temp-coffee) 
((temperature ice-cube) inane temp-ice-cube) 
((greater temp-coffee temp-ice-cube) :name >tenperature) 
(flat-top coffee) 
(liquid coffee))) 



(def Entity aun :type inanimate) 
(def Entity planet :type inanimate) 

(def Description aolar-ayatem 
antitiea (aun planet) 
facta (((masa aun) :name naaa-aun) 

((maaa planet) inane naaa-planet) 

((greater maaa-aun mass-planet) inane >naaa) 

((attracts aun planet) inane attracts) 

( (revolYe-around planet sun) :same revolve) 

((and >aaas attracts) inane andl) 

((cause andl revolve) inane cause -revolve) 

((temperature sun) inane tenp-sun) 

((temperature planet) inane tenp-planet) 

((greater temp-sun temp-planet) inane >teap) 

((gravity nass-sun mass-planet) inane force-gravity) 

((cause force-gravity attracts) mama shy-attracts))) 
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B.2 Solar-System - Rutherford Atom 



Solar System 



Rutherford Atom 



(def Entity 
(def Entity 



nucleus itype inanimate) 
electron itype inanimate) 
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(def Description rutherf ord-ato* 
entities (nucleus electron) 
facts (((mass nucleus) :name mass-n) 
((mass electron) :name iui-i) 
((greater nass-n mass-t) :neae >mass) 
((attracts nucleus electron) :name attracts) 
((rerolYe-around electron nucleus) :name reTolve) 
((charge electron) ;name q-electron) 
((charge nucleus) :name q-nudeus) 
((opposite-sign q-nucleus q-electron) :naae > charge) 
((cause >charge attracts) :name why-attracts))) 



B.3 Karla Stories 



(def Entity Karla) 
(del Entity hunter) 
(def Entity feathers) 
(def Entity cross-bow) 
(def Entity Failed) 
(def Entity high) 



Zerdia the eagle - base story 



(def Description base-6 

entities (Karla hunter feathers cross-bow Failed high) 
facts (((bird Karla) :name bird-Karla) 

[(person hunter) : name person-hunter) 
[(warlike hunter) :name warlike-hunter) 

[(Karlas-asset feathers) :name feathers-asset) 
[(weapon cross-bow) :name weapon-bow) 

[(used-for feathers cross-bow ) :nome has-f eathers) 
[(not has-f eathers) :naae not-has-f eathers) 

[(attack hunter Karla) :name attack-hunter) 
[(not attack-hunter) :name not-attack) 
[(see Karla hunter) :name see-Karla) 

[(follow see-Karla attack-hunter) :name follow-aee-attack) 
[(success attack-hunter) :name success -attack) 
[(equals success-attack Failed) :name failed- at tack) 

[(cause not-has' feathers failed-attack) :naae cause-f ailed-attack) 
[(desire hunter feathers) :name desire-feathers) 

[(realize Karla desire-feathers) :name realize-desire) 
[(follow failed-attack realize-desire) :name follow-realize) 
[(offer Karla feathers hunter) :name offer-feathers) 
[(cause realize-desire offer-feathers) :name cause-offer) 
[(obtain hunter feathers) :name take-feathers) 
[(cause offer-feathers take-feathers) :name cause-take) 
[(happiness hunter) :name happiness-hunter) 

[(equals happiness-hunter high) :name happy-hunter) 
[(cause take-feathers happy-hunter) :name cause-happy) 

[(promise hunter Karla not-attack) :name promise-hunter) 

[(cause happy-hunter promise-hunter) :nase cause-promise))) 



Zerdia the country - TA5 



(def Entity Zerdia) 



58 



REFERENCES 



(daf Entity Gagrach) 
(dafEntity auparcoaputar) 
(dafEntity aiaailaa) 
(dafEntity failed) 
(daf Entity high) 

(dafDaacription ta-6 

antitiaa (Zardia Gagrach auparcoaputar aiaailaa f tiltd high) 
facts (((country Zardia) rnaaa country-Zardia) 

((country Gagrach) rnaaa country-Gagrach) 

((warlike Gagrach) rnaaa warlika-Gagrach) 

((Zardias-aaaat auparcoaputar) rnaaa auparcoaputar-aaaat) 
((wsapon aiaailaa) rnaaa waapon-bow) 

((usad-for suparcoaputar aiaailaa ) rnaaa uaa-auparcoaputar) 
((not uaa-auparcoaputar) :naaa not -uaa-auparcoaputar) 
((attack Gagrach Zardia) rnaaa attack-Gagrach) 
((not attack-Gagrach) rnaaa not-attack) 
((auccaaa attack-Gagrach) rnaaa sue caa a -attack) 

((aquala succaaa -attack failad) rnaaa f ailad-attack) 

((cauaa not -uaa-aur*rcoapnt ar f ailad-attack) rnaaa causa-f ailad-attack) 
((daaira Gagrach suparcoaputar) rnaaa daaira-auparcoaputar) 
((raallaa Zardia daaira-auparcoaputar) rnaaa raaliz a- daaira) 
((follow f ailad-attack raaliza-daaira) rnaaa f ollow-raaliza) 
((off ar Zardia auparcoaputar Gagrach) rnaaa off ar-auparcoaputar) 
((cauaa raaliza-dasira off ar-auparcoaputar) rnaaa cauaa-offar) 
((obtain Gagrach suparcoaputar) rnaaa buy-auparcoaputar) 
((cauaa off ar-*uparcoaputar buy-auparcoaputar) :nama causa-buy) 
((happinaaa Gagrach) rnaaa happinaaa -Gagrach) 
((aquala happinaaa -Gagrach high) :nama happy-Gagrach) 
((cauaa buy-auparcoaputar happy -Gagrach) :naaa cauaa -happy) 
((proaisa Gagrach Zardia not-att&ck) :naaa proaiaa) 
((cauaa happy-Gagrach proaiaa) :naaa cauaa-proaica))) 

Zerdia the hawk - MAS 

(daf Entity Zardia) 
(daf Entity aportaaan) 
(daf Entity f a at hara) 
(daf Entity croaa-bow) 
(def Entity trua) 

(dafDaacription aa-6 

antitiaa (Zardia aportaaan f aathara croaa-bow trua) 

facta (((bird Zardia) rnaaa bird-Zardia) 

((parson aportaaan) rnaaa para on- aportaaan) 

((warlika aportaaan) rnaaa war lika- aportaaan) 

((Zardiaa-aaaat faathara) rnaaa faathara-aaaat) 
((waapon croaa-bow) rnaaa waapon-bow) . 

((uaad-for faathara cro^a-bow ) rnaaa haa-f aathara) 
((daaira aportaaan faathara) rnaaa daaira-f aathara) 
((raaliza Zardia daaira-f aathara) rnaaa raaliza-deaira) 

((offar Zardia faathara aportaaan) rnaaa off ar-f aathara) 
((cauaa raaliza-ilaaira off ar-f aathara) rnaaa cauaa-offar) 
((obtain aportaam faathara) rnaaa taka-f aathars) 

((cauaa off ar-f aathars taka-f aathara) rnaaa cauaa- take) 
((attack sportsaau Zardia) rnaaa attack-sportaaan) 
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(not attack-sportiaan) :naae not-attack) 

(promise sportsman Zardia not-attack) :naxe promise) 

(cause take-feathers promise) :nui cauae-promiie) 

(aaa Zardia aportsmta) :name saa-Zardia) 

(follow promise aee-Zerdia) :naae f ollow-promise) 

(follow see-Zerdia attack-sportsman) :name follow-see) 

(success attack-eportaman) :name iuc cm -attack) 

(equala success -attack true) :name successful- at tack) 

(causa haa-f tathora successful-attack) -.name cauee-auccess-attack) 

(realize Zardia hae-f eathera) :name realize-Zerdia) 

(follow successful-attack realize -Zardia) :name f ollow-succ-attack))) 
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